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HEPITITIS C VIRUS EPITOPES 

1. Field of Invention 

10 This invention relates to specific peptide viral anti- 

gens which are immunoreactive with sera from patients in- 
fected with parenterally transmitted non-A, non-B hepatitis 
virus (PT-NANBH, now called Hepatitis C Virus) , to poly- 
nucleotide sequences which encode the peptides, to an ex- 

15 pression system capable of producing the peptides, and to 
methods of using the peptides for detecting PT-NANBH infec- 
tion in human sera. 
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3. Background 

Viral hepatitis resulting from a virus other than 
10 hepatitis A virus (HAV) and hepatitis B virus (HBV) has 
been referred to as non-A, non-B hepatitis (NANBH) . More 
recently, it has become clear that NANBH encompasses at 
least two, and perhaps more, quite distinct viruses. One 
of these, known as enter ically transmitted NANBH or ET- 

15 NANBH, is contracted predominantly in poor-sanitation areas 
where food and drinking water have been contaminated by 
fecal matter. The molecular cloning of a portion of this 
virus, referred to as the hepatitis E virus (HEV) , has 
recently been described (Reyes et al.). 

20 The second NANB virus type, known as parenteral ly 

transmitted NANBH, or PT-NANBH, is transmitted by parenter- 
al routes, typically by exposure to blood or blood prod- 
ucts. Approximately 10% of transfusions cause PT-NANBH 
infection, and about half of these go on to a chronic 

25 disease state (Dienstag) . 

Human sera documented as having produced post-trans- 
fusion NANBH in human recipients has been used successfully 
to produce PT-NANBH infection in chimpanzees (Bradley) . 
RNA isolated from infected chimpanzee sera has been used to 

30 construct cDNA libraries in an expression vector for immu- 
noscreening with chronic-state human PT-NANBH serum. This 
procedure identified a PT-NANBH specific cDNA clone and the 
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viral sequence was then used as a probe to identify frag- 
ments making up 7,300 contiguous basepairs of a PT-NANBH 
viral agent (EPO patent application 88310922.5, filed 
11/18/88). The same procedure was used by the present 
5 inventors to derive two of the PT-NANBH peptide and poly- 
nucleotide sequences disclosed herein. The sequenced viral 
agent has been named HCV (HCV) (above EPO patent applica- 
tion) . 

Heretofore, one immunogenic peptide encoded by the HCV 
10 viral agent has been reported (Choo, Kuo, EPO application 
88310922.5) . This peptide, designated C-100, has been used 
in immunoassays of PT-NANBH sera and found to react immuno- 
specifically with up to 80% of chronic NANBH samples, and 
about 15% of acute NANBH samples (Kuo) . 
15 it is desirable to provide one or a collection of pep- 

tide antigens which are immunoreactive with a greater per- 
centage of PT-NANBH-infected blood, including both acute 
and chronic PT-NANBH infection. 

20 4. Summary of the Wnfi^ 

It is one general object of the invention to provide 
recombinant polypeptides immunoreactive with sera from 
humans infected with hepatitis C virus (HCV) , including a 
peptide which is immunoreactive with a high percentage of 

25 sera from chronic HCV-infected individuals, and peptides 
which are immunoreactive with sera associated with acute 
HCV infection. 

It is another object of the invention to provide an 
HCV polynucleotide sequence encoding a sequence for recom- 
30 binant production of the peptide antigens, and a diagnostic 
method for detecting HCV-infected human sera using the pep- 
tide antigens. 
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The invention includes, in one aspect, a peptide anti- 
gen which is immunoreactive with sera from humans infected 
with HCV. one peptide antigen in the invention includes an 
immunoreactive portion of an HCV polypeptide which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO:4. 

Other peptide antigens of the invention include an im- 
munoreactive portion of any one of the following sequences: 
SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ 
ID NO:14, SEQ ID NOUS, SEQ ID N0:18, SEQ ID NO:20, SEQ ID 
NO:22, SEQ ID NO:24, and SEQ ID NO:26.. 

In another aspect, the invention includes diagnostic 
15 kits for use in screening human blood containing antibodies 
specific against HCV infection. The kit includes at least 
one peptide antigen which is immunoreactive with sera from 
humans infected with hepatitis c virus (HCV): specific 
peptide antigens for use in the kit are given above. 
20 one preferred embodiment of the present invention is 

a diagnostic kit containing the 409-1-1 (c-a) (SEQ ID NO: 8) 
and one of the HCV-capsid derived proteins (SEQ ID N0s:i2, 
14, 16, 18, 20, 22, 24, and 26): two particular embodi- 
ments being 409-1-1 (c-a) with the C1NC450 capsid-derived 
25 peptide, and 409-1-1 (c-a) with the C1NC360 capsid-derived 
peptide. 

In one embodiment of the present invention, the 
antigen is immobilized on a solid support. The binding of 
HCV-specific antibodies to the immobilized antigen is 
detected by a reporter-labeled anti-human antibody which 
acts to label the solid support with a detectable reporter. 
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The kit is used in a method for detecting HCV infec- 
tion in an individual by: (i) reacting serum from an HCV- 
infected test individual with the above peptide antigen, 
and (ii) examining the antigen for the presence of bound 
antibody. 

The peptide antigens are produced, in accordance with 
another aspect of the invention, using an expression system 
for expressing a recombinant peptide antigen which is immu- 
noreactive with sera from humans infected with hepatitis c 
virus. (HCV). A selected expression vector containing an 
open reading frame (ORF) of a polynucleotide which encodes 
the peptide is introduced into a suitable host, Which is 
cultured under conditions which promote expression of the 
ORF in the expression vector. 

In one embodiment, the polynucleotide is inserted into 
an expression site in a lambda gtll phage vector, and the 
vector is introduced into an E. coli host. The following 
E. coli hosts have been deposited which contain vectors 
including the coding sequences of the antigens shown in 
20 parenthesis: ATCC No 40901 (SEQ ID N0:3), ATCC NO. 40893 
(SEQ ID NO:l), and ATCC No. 40792 (SEQ ID NO:7), and ATCC 
No. 40876 (SEQ ID NO: 9). pGEX and pET are two other 
vectors which have been used to express HCV antigens, it 
will be appreciated that determination of other appropriate 
vector and host combinations for the expression of the 
above sequences are within the ability of one of ordinary 
skill in the art. 

Also forming part of the invention are polynucleotides 
which encode polypeptides immunoreactive with sera from 
30 humans infected with hepatitis c virus (HCV) . one polynu- 
cleotide of the present invention encodes a polypeptide 
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wherein the polypeptide includes an immunoreactive portion 
of a peptide sequence which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

5 c) has the car boxy-terminal sequence presented as SEQ 

ID NO: 4; and, where the carboxy-terminal amino acid se- 
quence of said peptide antigen is encoded by the polynu- 
cleotide sequence presented as SEQ ID NO: 3. 

Other polynucleotides of the invention include any one 
10 of the following sequences: SEQ ID NO:l, SEQ ID NO: 3, SEQ 
ID NO:5, SEQ ID N0:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID 
NO:21, SEQ ID NO:23, and SEQ ID NO:25. 

These and other objects and features of the invention 
15 will become more fully apparent when the following detailed 
description is read in conjunction with the accompanying 
drawings. 

5. Brief Description of the Drawings 

Figure 1 illustrates the steps in producing overlap- 
20 ping linking fragments of a nucleic acid segment, in 
accordance with the methods of the present invention; 

Figure 2 shows the positions of overlap primer regions 
and linking regions along a 7,300 basepair portion of the 
HCV genome. 

25 Figure 3 shows the DNA coding sequence of the clone 40 

insert. The underlined sequences correspond to an R, primer 
region. 

Figure 4 shows the DNA coding sequence of a clone 36 
insert. The underlined sequences correspond, respectively, 
30 to the F 7 , F 8 , and R 8 primer regions. 

Figure 5 shows the DNA and protein coding sequences 
for a 409-1-1 (abc) clone insert. The "A" region of this 
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sequence is delineated by boxes, the "B" region by a box 
and a triangle, and the "C" region by a triangle and an 
asterisk. 

Figure 6 shows the DNA and protein coding sequences 
5 for a 409-1-1 (c-a) clone insert. 

Figure 7 illustrates the groups of clones which have 
been obtained from the HCV genome in the region correspond- 
ing to the 409-1-1 (abc) clone insert. 

Figure 8A shows the DNA and protein coding sequences 
10 for ..the pGEX-GGl insert. The three G's above the first 
line indicate where substitutions were made to generate the 
clone pGEX-CapA. Figure 8B shows the DNA and protein 
sequences for the pGEX-CapA insert coding sequence. The 
primers used in polymerase chain reactions to generate 
15 carboxy and amino terminal deletions are indicated below 
the nucleotide line. The sequences of the primers are 
indicated in the sense (coding strand) . The actual 
sequence of the NC (non-coding) primers is the reverse 
complement of the indicated sequence. Coding primers are 
20 underlined; reverse (noncoding) primers are double-under- 
lined. Sequences shown in capital letters are exact 
matches. Sequences in lowercase letters are "mismatched" 
sequences used to introduce the terminal restriction sites 
(Ncol at the 5' ends and BamHI at the 3' ends). The three 
25 nucleotides which have been altered to remove the "slippery 
codons" at positions 24, 27, and 30 are indicated by bold 
type with the wild type A residues shown above the se- 
quence. 

Figure 9 shows a hydropathicity plot of the HCV-core 
30 protein encoded by pGEX-CapA. The relative location of the 
primers, used to generate carboxy and amino terminal 
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deletions, are indicated relative to the protein coding 
sequence by arrows. 

Figure 10 shows an epitope map of the HCV capsid 
protein region. 

5 

6. Detailed Descripti on of the Invention 

» 

I. Definitions 

The terms defined below have the following 
10 meaning herein: 

1. "Parenterally transmitted non-A, non-B hepatitis 
viral agent (PT-NANBH) " means a virus, virus type, or virus 
class which (i) causes parenterally transmitted infectious 
hepatitis, (ii) is transmissible in chimpanzees, (iii) is 

15 serologically distinct from hepatitis A virus (HAV) , hepa- 
titis B virus (HBV) , and hepatitis E virus (HEV) . 

2. "HCV (HCV)" means a PT-NANBH viral agent whose 
polynucleotide sequence includes the sequence of the 7,300 
basepair region of HCV given in the Appendix, and varia- 

20 tions of the sequence, such as degenerate codons, or 
variations which may be present in different isolates or 
strains of HCV. 

3. Two nucleic acid fragments are "homologous" if 
they are capable of hybridizing to one another under 

25 hybridization conditions described in Maniatis et al^., op. 
cit^., pp. 320-323, using the following wash conditions: 2 
x SCC, 0.1% SDS, room temperature twice, 30 minutes each; 
then 2 x SCC, 0.1% SDS, 50°c once, 30 minutes; then 2 x 
SCC, room temperature twice, 10 minutes each, homologous 

30 sequences can be identified that contain at most about 25- 
30% basepair mismatches. More preferably, homologous 
nucleic acid strands contain 15-25% basepair mismatches, 
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even more preferably 5-15% basepair mismatches. These 
degrees of homology can be selected by using more stringent 
wash or hybridization conditions for identification of 
clones from gene libraries (or other sources of genetic 
material) , as is well known in the art. 

4. A DNA fragment is "derived from" HCV if it has 
substantially the same basepair sequence as a region of the 
HCV viral genome which was defined in (2) above. 

5. A protein is "derived from" a PT-NANBH or HCV 
vira*. agent if it is encoded by an open reading frame of a 
cDNA or RNA fragment derived from a PT-NANBH or HCV viral 
agent , respectively. 

11 • Molecular Clone Selection h Y Immunoscr^ i ng 

As one approach toward identifying a molecular clone 
of a PT-NANBH agent, cDNA libraries are prepared from in- 
fected sera in the expression vector lambda gtn. cDNA se- 
quences are then selected for expression of peptides which 
are immunoreactive with PT-NANBH-infected sera. Recombi- 
nant proteins identified by this approach provide candi- 
dates for peptides which can serve as substrates in diag- 
nostic tests. Further, the nucleic acid coding sequences 
identified by this approach serve as useful hybridization 
probes for the identification of further PT-NANBH coding 
25 sequences. 

In order to make immunoscreening a useful approach for 
identifying clones originating from PT-NANBH coding sequen- 
ces, a well-defined source of PT-NANBH virus is important. 
To generate such a source, a chimpanzee (#771; Example 1A) 
30 was infected with transmissible PT-NANBH agents using a 
Factor VTII concentrate as a source (Bradley) . The Factor 
VIII concentrate was known to contain at least two forms of 
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parenterally transmitted NANB hepatitis (PT-NANBH) . In ad- 
dition to a chloroform-sensitive agent, which has subse- 
quently been called HCV (HCV) , a chloroform-resistant form 
of PT-NANBH was also transmitted in the concentrate 
5 (Bradley, 1983): 

In the method illustrated in Example 1, infected serum 
was pelleted, without dilution, by centrifugation, and cDNA 
libraries were generated from the resulting pelleted virus 
(Example IB and 1C) . Sera from infected human sources were 
10 treated in the same fashion. cDNA libraries were generat- 
ed, e.g., by a random primer method using the RNA extracted 
from pelleted sera as starting material (Example IB and 
1C) . The resulting cDNA molecules were then cloned into a 
suitable vector, for example, lambda gtll, for expression 
15 and screening of peptide antigens, and lambda gtlO, for 
hybridization screening (Example lC(iv)). Lambda gtll is 
a particularly useful expression vector which contains a 
unique EcoRI insertion site 53 base pairs upstream of the 
translation termination codon of the beta-galactosidase 
20 gene. Thus, an inserted sequence is expressed as a beta- 
galactosidase fusion protein which contains the N-terminal 
portion of the beta-galactosidase gene, the heterologous 
peptide, and optionally the C-terminal region of the beta- 
galactosidase peptide (the C-terminal portion being ex- 
25 pressed when the heterologous peptide coding sequence does 
not contain a translation termination codon) . This vector 
also produces a temperature-sensitive repressor (cI857) 
which causes viral lysogeny at permissive temperatures, 
e.g., 32 °C, and leads to viral lysis at elevated tempera- 
30 tures, e.g., 42 °C. Advantages of this vector include: (l) 
highly efficient recombinant clone generation, (2) ability 
to select lysogenized host cells on the basis of host-cell 
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growth at permissive, but not non-permissive, temperatures, 
and (3) high levels of recombinant fusion protein produc- 
tion. Further, since phage containing a heterologous in- 
sert produces an inactive beta-galactosidase enzyme, phage 
5 with inserts are typically identified using a beta-galacto- 
sidase colored-substrate reaction. 

In the screening procedure reported in Examples 1-3, 
individual cDNA libraries were prepared from the serum of 
one PT-NANBH infeced chimpanzee (#771) and four PT-NANBH 
10 infected humans (designated EGM, BV, WEH, and AG) . These 
five libraries were immunoscreened using PT-NANBH positive ' 
human or chimpanzee sera (Example 2): ill lambda gtll 
clones were identified which were immunoreactive with at 
least one of the sera. Of these 111 clones, 93 were exa- 
15 mined for insert hybridization with normal DNA. The in- 
serts were radioactively labelled and used as probes 
against Hindlll/BcoRI doubly-digested human peripheral 
lymphocyte (PBL) DNA (Example 3). Approximately 46% 
(43/93) of the inserts hybridized with normal human PBL DNA 
20 and were therefore not pursued. Inserts from 11 PT-NANBH- 
immunopositive clones derived from chimpanzee #771 sera 
were characterized as exogenous to normal human PBL DNA 
(Example 3) . of these 11 clones 2 PT-NANBH clones were 
identified having the following characteristics. One clone 
(clone 40) was clearly exogenous by repeated hybridization 
tests against normal human PBL DNA, had a relatively small 
insert size (approximately 0.5 kilobases), and was quite 
unreactive with negative control serum. The second clone 
(clone 36) was shown to be reactive with multiple PT-NANBH 
antisera, had a relatively large insert size (approximately 
1.5 kilobases), and was exogenous by hybridization testing 
against normal human PBL DNA. The immunoreactive charac- 
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teristics of clones 36 and 40 are summarized in Table 1 
(Example 3). Clone 36 was immunoreactive with chimpanzee 
#771 sera and two HCV-positive human sera, AG and BV. The 
clone 36 antigen did not immunoreact with the negative con- 
5 trol sera SKF. Clone 40 was immunoreactive with chimpanzee 
#771 sera and was cleanly nonreactive when the negative 
control sera was used for screening. 

The DNA sequence of clone 36 was determined in part 
and is shown in Figure 4. This sequence corresponds to 
10 nucleotides 5010 to 6516 of the HCV sequence given in the 
Appendix. The DMA sequence was also determined for the 
clone 40 insert (Figure 3). This sequence is homologous to 
the HCV sequence (Appendix) in the region of approximately 
nucleotides 6515 to 7070. The inserts of two other chim- 
15 panzee #771 clones, clones 44 and 45, were found to be 
homologous to clone 40 by hybridization and sequence ana- 
lysis (Example 4) . The sequences for clones 36 and 40 are 
contiguous sequences, with the clone 36 sequences being 
located 5' of the clone 40 sequences as presented in the 
20 Appendix. Accordingly, these two clones represent isola- 
tion of a significant block of the HCV genome by the above- 
described immunoscreening methods. 

The four lambda gtll clones 36, 40, 44, and 45 were 
deposited in the Genelabs Culture Collection, Genelabs 
25 Incorporated, 505 Penobscot Drive, Redwood City, CA 94063. 
Further, the lambda gtll clones of clones 36 and 40 were 
deposited with the American Type Culture Collection, 12301 
Parklawn Dr., Rockville MD, 20852, and given the deposit 
numbers ATCC No. 40901 and ATCC 40893. 

30 



III. PT-NANBH Secruence Identification bv Hybridization 
Methods . 
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The polynucleotides identified in Section II can be 
employed as probes in hybridization methods to identify 
further HCV sequences, and these can then be used as probes 
to identify additional sequences. The polynucleotides can 
be directly cloned or fragmented by partial digestion to 
generate random fragments. The resulting clones can be 
immunoscreened as described above to identify HCV antigen 
coding sequences. 

To illustrate how the inserts of clones 36 and 40 can 
be used to identify clones carrying HCV sequences, the 
insert of clone 40 was isolated and used as a hybridization 
probe against the individual cDNA libraries established in 
lambda gtlO (see above) . Using the clone 40 probe approxi- 
mately 24 independent hybridization-positive clones were 
15 plaque purified (Example 5). The positive signals arose 
with different frequencies in cDNA libraries from the 
different serum sources, suggesting that the hybridization 
signals were from the serum sources, rather than resulting 
from some common contaminant introduced during the cDNA 
20 synthesis or cloning (Table 2). One of the clones, 108-2- 
5, which tested positive by hybridization with the clone 40 
insert, had an insert of approximately 3.7 kb (Example 6). 
Since it had such a large insert, clone 108-2-5 was chosen 
for further analysis. The serum source of this cDNA clone 
25 was EGM human PT-NANBH serum (Example 1) . 

The insert of 108-2-5 was isolated by EcoRI digestion 
of the lambda gtlO clone, electrophoretic fractionation, 
and electroelution (Example 6). The isolated insert was 
treated with DNase I to generate random fragments (Example 
6) , and the resulting digest fragments were inserted into 
lambda gtll phage vectors for immunoscreening. The lambda 
gtll clones of the 108-2-5 fragments were immunoscreened 
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(Example 6) using human (BV and normal) and chimpanzee #771 
serum. Twelve positive clones were identified by first 
round immunoscreening with the human and chimp sera. Seven 
of the 12 clones were plaque purified and rescreened using 
5 chimpanzee #771 serum. Partial DNA sequences of the insert 
DNA were determined for two of the resulting clones, 
designated 328-16-1 and 328-16-2. These two clones 
contained sequences essentially identical to clone 40 
(Example 6) . 

10 _The clone 36 insert can be used in a similar manner to 

probe the original cDNA library generated in lambda gtlO. 
Specific subfragments of clone 36 may be isolated by 
Polymerase chain reaction or after cleavage with restric- 
tion endonucleases. These fragments can be radioactively 
15 labelled and used as probes against the cDNA libraries 
generated in lambda gtlO (Example 1C) . In particular, the 
5' terminal sequences of the clone 36 insert are useful as 
probes to identify clones overlapping this region. 

Further, the sequences provided by the terminal clone 
20 36 insert sequences and the terminal clone 40 insert 
sequences are useful as specific sequence primers in first- 
strand DNA synthesis reactions (Maniatis et al.; Scharf et 
al.) using, for example, chimpanzee #771 sera generated RNA 
as substrate. Synthesis of the second-strand of the cDNA 
25 is randomly primed. The above procedures identify or 
produce cDNA molecules corresponding to nucleic acid 
regions that are 5' adjacent to the known clone 36 and 40 
insert sequences. These newly isolated sequences can in 
turn be used to identify further flanking sequences, and so 
30 on, to identify the sequences composing the HCV genome. As 
described above, after new HCV sequences are isolated, the 
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polynucleotides can be cloned and immunoscreened to 
identify specific sequences encoding HCV antigens. 

IV - Generating Overlapping Cloned T.in^ nq JfcflgaiDfeg, 

This section describes a method for producing and 
identifying HCV peptides which may be useful as HCV- 
diagnostic antigens. The present method is used to 
generate a series of overlapping linking fragments which 
span a segment of nucleic acid. The application of the 
method to generating a series of overlapping linking frag- 
ments which span a 7,300 basepair segment of the HCV 
genome, whose sequence is given in the Appendix, will be 
described with reference to Figures 1 and 2. 

As a first step in the method, and with reference to 
Figure 1, the nucleic acid of interest is obtained in 
double-strand DNA form. Typically, this is done by 
isolating genomic DNA fragments or by producing cDNAs from 
RNA species present in a sample fluid. The latter method 
is used to generate double-strand DNA from NANBH viral RNA 
present in serum from chimpanzees or humans with known PT- 
NANBH infection. Here rna in the sample is isolated, e.g., 
by guanidinium thiocyanate extraction of PEG precipitated 
virions, and reacted with a suitable primer for first 
strand cDNA synthesis. 

First-strand cDNA priming may be by random primers, 
oligo dT primers, or sequence-specific primer (s). The 
primer conditions are selected to (a) optimize generation 
of cDNA fragments which collectively will span the nucleic 
acid segment of interest, and (b) produce cDNA fragments 
which are preferably equal to or greater than about 1,000 
basepairs in length, in one method applied to HCV RNA, the 
first-strand synthesis is carried out using sequence- 
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specific primers which are complementary to spaced regions 
along the length of the known HCV genomic sequence. The 
primer position are indicated at A, B, C, and D in Figure 
2, which shows a map of the HCV genome segment. The 
5 basepair locations of the primers in the HCV genome are 
given in Example 7 below. Following first strand synthe- 
sis, the second cDNA strand is synthesized by standard 
methods . 

The linking fragments in the method are produced by 
10 sequence-specific amplification of the double-strand DNA 
obtained as above, using pairs of overlap-region primers to 
be described. According to an important advantage of the 
methods of the present invention, it is possible to 
generate linking fragments even when the amount of double- 
15 strand DNA is too low for direct sequence-specific amplifi- 
cation. This limitation was found, for example, with HCV 
cDNA's produced from NANBH- infected serum. Here the amount 
of double-stranded DNA available for amplification is first 
amplified nonspec ificallv by a technique known as Sequence- 
20 Independent Single-Primer Amplification (SISPA) . 

The SISPA technique is detailed in co-owned U.S. 
Patent application for "RNA and DNA Amplification Tech- 
niques", Serial No. 224,961, filed July 26, 1988. The 
method as applied to amplification of HCV cDNA fragments is 
25 also described in Example 7. Briefly, known-sequence 
linker primers are attached to opposite ends of double- 
stranded DNA in a DNA sample. These linkers then provide 
the common end sequences for primer-initiated amplifica- 
tion, using primers complementary to the linker/primer 
30 sequences. Typically, the SISPA method is carried out for 
20-30 cycles of amplification, using thermal cycling to 
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achieve successive denaturation and primer-initiated 
polymerization of second strand DNA. 

Figure l illustrates the SISPA amplification of duplex 
DNA, to form amplified fragments which have known-sequence 
regions P s . As seen, the fragment mixture includes at least 
some fragments which (a) overlap at regions P, with other 
fragments in the mixture and (b) contain complete linking 
regions between adjacent P, and P i+1 regions. Collectively, 
each linking region bounded by the associated overlap 
regions making up the segment is present in at least one 
DNA fragment. 

The production of overlapping linking fragments, in 
accordance with the methods of the present invention, is 
carried out using the polymerase chain reaction (PGR) 
method described in U.S. Patent No. 4,683,195. in practic- 
ing this step of the method, first the total segment of 
interest is divided into a series of overlapping intervals 
bounded by regions of known sequence, as just described. 
In Figure 2, the 7,300 basepair segment of the HCV genome 
has been divided into io intervals, each about 500-1,000 
basepairs in length. The intervals are designated accord- 
ing to the forward F s and reverse R. primers used in amplify- 
ing the sequence, as will be described. The selection of 
the intervals is guided by (a) the requirement that the 
basepair sequence at each end of the interval be known, and 
(b) a preferred interval length of between about 500 and 
2,000 basepairs. 

In the method applied to the 7,300 basepair segment of 
the HCV genome, the regions of overlap between the ten 
intervals were additionally amplified, to verify that the 
SISPA-amplified cDNA sample contained sufficient HCV cDNA 
to observe PCR-amplification of HCV linking fragments, and 
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that HCV regions along the entire length of the genome were 
available for amplification. Each overlap region in the 
segment can be defined by a pair of primers which includes 
a forward primer Fj and a reverse primer Rj which are 
5 complementary to opposite strands of opposite ends of the 
overlap region. The primers are typically about 20 base- 
pairs in length and span an overlap region of about 200 
basepairs. The eleven overlap regions in the HCV segment 
and the regions corresponding to the forward and reverse 
10 primers in each region are given in Example 8. 

The primers Fj/Rj are added to the amplified DNA 
material in a PCR reaction mix, and the overlap region 
bounded by the primers is amplified by 20-30 thermal 
cycles. The reaction material is then fractionated, e.g., 
15 by agarose gel electrophoresis, and probed for the presence 
of the desired sequence, e.g., by Southern blotting 
(Southern) , using a radiolabeled oligonucleotide probe 
which is specific for an internal portion of the overlap 
region. As described in Example 8', this method was 
20 successful in producing amplified fragments for each of the 
eleven Fj/Rj overlap regions in the HCV genome segment. The 
overlap-region fragments may be used as probes for the 
corresponding (two) linking fragments connected by the 
overlap region. It is emphasized, however, that this 
25 amplification step was employed to confirm the presence of 
amplifiable cDNA along the length of the HCV genome, and 
not as an essential step in producing the desired linking 
fragments. The step is omitted from Figure 1. 

The linking fragments Fj/Rj are produced by a two-primer 
30 PCR procedure in which the SISPA-amplif ied DNA fragments 
are amplified by a primer pair consisting of the forward 
primer F, of one overlap region and the reverse primer Rj of 
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an adjacent overlap region. The ten overlap regions in the 
HCV segment and the regions corresponding to the forward 
and reverse primers in each region are given in Example 9. 
Typical amplification conditions are give in Example 9. 
5 The amplified fragments in each reaction mixture are 
isolated and purified, e.g., by gel electrophoresis, to 
confirm the expected fragment size. Southern blots may be 
probed with oligonucleotide probes complementary to 
internal regions located between the fragment ends, to 
10 confirm the expected sequence of the fragments. As shown 
at the bottom in Figure 1, the method generates the 
complete set of linking fragments, where each fragment is 
bounded by an overlap region Pj and P i+ i. 

The method, as applied to generating ten overlapping 
15 linking fragments of the 7,300 basepair HCV genome, is 
described in Example 9. As demonstrated by size criteria 
on gel electrophoresis and by sequence criteria by Southern 
blotting, the method was successful in generating all ten 
of the overlapping fragments spanning the HCV genome. 
20 it will be appreciated that the above flanking 

sequence amplification method can be applied to the 
generation of DNA fragments corresponding to the insert 
sequences of clones 36 and 40, which have also been 
obtained by immunoscreening. The linker primers flanking 
25 the inserts are easily used to generate sequences corre- 
sponding to the clone inserts. For example, two-primer 
amplification of the SISPA-amplified cDNA fragments (Exam- 
ple 7) using the F^/Rj primer pair (the sequences of which 
are given in Example 8) is carried out under conditions 
30 similar to those described in Example 9. The amplified 
fragment mixture is fractionated by agarose electrophoresis 
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on 1.0 % agarose, and the expected band cut from the gel 
and eluted. 

The purified amplified fragment is treated with the 
Klenow fragment of DNA polymerase I to assure the molecules 
5 are blunt-ended. The fragment is then ligated to EcoRI 
linkers (Example 10). The mixture is digested with EcoRI 
and inserted into the lambda gtll vector. The resulting 
clones contain the entire coding sequences of either the 
clone 36 or clone 40 inserts. 
10 , Alternatively, the original amplified 36/40 fragment 

(primers F 12 /R,) is briefly treated with Exonuclease III 
(Boehringer Mannheim, as per manufacturer's instructions) 
to generate a family of fragments with different 5' ends. 
The digestion products are treated as above and ligated 
15 into the lambda gtll vector. The resulting plagues are 
then immunoscreened. 

Further, different sets of primers, other than the 
F n/*9 primers described above, can be used to directly 
generate sequence encoding all, or portions, of clones 36 
20 and 40. For example, primers Fg/R, can generate a fragment 
corresponding to a portion of the 3' sequences of the 
insert of clone 36 (Figure 4) and all of the insert 
sequences of clone 40 (Figure 3). Also, primers F 7 /R g can 
be used to directly generate a fragment corresponding to a 
25 portion of the 5' sequences present in the insert of clone 
36 (Figure 4) . 

V. PT-NANBH Immunoreactive Peptide Fragments 

Several novel peptide antigens which are immunoreac- 
30 tive with sera from human and chimpanzee NANBH-infected 
sera have been generated from the NANBH linking fragments 
produced above, in accordance with the methods of the 
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present invention. Further, this method has confirmed 
antigenic regions previously identified by cDNA library 
immunoscreening (Section II above) . The antigen peptides 
derived from linking fragments are preferably produced in 
5 a method which involves first digesting each of the above 
linking fragments with DNasel under partial digestion 
conditions, yielding DNA digest fragments predominantly in 
the 100-300 basepair size range, as illustrated in Example 
10. The digest fragments may be size fractionated, for 
10 example by gel electrophoresis, to select those in the 
desired size range. 

The digest fragments from each linking fragment are 
then inserted into a suitable expression vector. one 
exemplary expression vector is lambda gtll, the advantages 
15 of which have been described above. 

For insertion into the expression vector, the digest 
fragments may be modified, if needed, to contain selected 
restriction-site linkers, such as EcoRl linkers, according 
to conventional procedures. Typically, the digest frag- 
ments are blunt-ended, ligated with EcoRl linkers, and 
introduced into £coJ?I-cut lambda gtll. Such recombinant 
techniques are well known in the art (e.g., Maniatis et 
al.). 

The resulting viral genomic library may be checked to 
confirm that a relatively large (representative) library 
has been produced for each linking fragment. This can be 
done, in the case of the lambda gtll vector, by infecting 
a suitable bacterial host, plating the bacteria, and exami- 
ning the plagues for loss of beta-galactosidase activity, 
30 as evidenced by clear plagues. 

The presence of a digest-fragment insert in the clear 
plaques can be confirmed by amplifying the phage DNA, using 
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primers specific for the regions of the gtll phage flanking 
the EcoRI insert site, as described in Example 10B. The 
results in Table 3 show that a large percentage of the 
plaques tested in each linking fragment library contained 
a digest-fragment insert. 

The linking-fragment libraries may also be screened 
for peptide antigens which are immunoreactive with human or 
chimpanzee sera identified with PT-NANBH chronic, convales- 
cent, or acute infection. One preferred immunoscreening 
method is described in Example 10B. Here recombinant pro- 
tein produced by the phage-inf ected bacteria is transferred 
from the plagues to the filter. After washing, the filter 
is incubated with test serum, and then reacted with repor- 
ter-labeled anti-human IgG antibody. The presence of the 
15 peptide antigen on the filter is then assayed for the pre- 
sence of the reporter. As seen from Table 3, several of 
the linking-fragment libraries were positive for immunore- 
active peptides in the primary screen. 

The immunoscreening method just described can be used 
20 to identify library plaques from each of the linking libra- 
ries which are immunoreactive with sera from human or chim- 
panzee with known chronic, convalescent, or acute PT-NANBH 
infection. One exemplary screening procedure is given in 
Example 11, where the ten HCV linking-fragment libraries 
25 are screened with known PT-NANBH (a) human chronic serum, 
(b) chimpanzee acute pooled sera and (c) chimpanzee chronic 
pooled sera. Of the ten libraries examined, only the F,/R, 0 
library did not give positive immunoreaction with any of 
the three sera. Several of the fragment libraries, inclu- 
ding Fj/R,, F 6 /R„, F„/R 7 , and F 7 /R 8 showed five or more 
positive reactions with chimpanzee acute sera, indicating 
that these libraries each express one or more peptide anti- 
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gens which are useful for dP^„4-.;„ 

acute FT-NANBH infection ' "^"^ ° r »»— 

freemen"! TT* ^ 00 W<* to an internal 

fragment of clone 36 insert (Section II; Figure 4) . Accor . 

dmgly, the linking fragment „ 8tn0d conf ir-eTthat thie^L 
region encodes a useful antigen. Further, the fraleT 
library F./R, oontains the seguences present „ the clone" 
insert (Section II: Figur es , ,„,, .. „. ne 40 

4 indicat. . , 3 and 4) . The results in Table 

detet ,H S * °" e PSPtlde "**»•» e «-tive to 

ZTt^lTT" °* ~ c -^io„ serum was isolated 
from the F./R, fragment library. 

A. Immunoreactive Screening 

Two of the immunoreactive plagues identified by immu - 
noreactive screening, designated 409-i- 1(abc) an / 40 ™ 
l(c-a). were tester* • 

documented pt™ f!! ^-reactivity against weU- 
nteo ft HANBH chronic sera which showed strona 
immunoreectivity to the s-i-i H CV peptide antigel £T 

tted T ^ anti9Sn ^ P ™ i0US * been Mt t 
fied as immunoreactive against a high percent*™ n, * 

PT-K^H chronic sera. The s-i-i JL£ZZ££Z 

iizzzTzr ba r pairs 3731 ana 3857 in «» - ~ 
t~i a ^r; ei L~r ? : iarger peptiae - 

. .... y seguence between basepairs 3531 

and 4442. The latter Dentirto <„ , . 
diaomvrf-*,. u-. - P e Ptide ls employed in a commercial 
diagnostic kit for detect-in., . 

in*.*.*,*.' , "tection of human HCV infection 

(Ortho/Chiron,. The kit is reported to react positively 

» of Z 01 h ™ an ehr ° niC W - NW)BH - abcut 

15% of human acute pt-nanbh sera, as noted above 

The 400-1-i (0 . a) phage was ldentified ^ 
screening and plague purified, as outlined aLve A 
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related clone, designated 409-1-1 (abc) , was described in 
the parent to the present application (U. S. Application 
Ser. No. 07/505,611, herein incorporated by reference). 
Clone 409-1-1 (abc) was designated 409-1-1 in the parent 
5 application. The a, b and c designations refer to three 
regions of the 409-1-1 (abc) sequence (see Figure 5). The 
5-1-1 coding sequence was isolated by polymerase chain 
reaction using oligonucleotide primers complementary to the 
ends of the 5-1-1 coding region, and cloned into lambda 
10 gtll,.for expression under induction conditions of a fused 
beta-galactosidase protein which includes the 5-1-1 antigen 
peptide region. The 5-1-1 phage was identified and plaque 
purified by similar methods. 

The 409-1-1 (c-a) and 5-1-1 antigens were compared by 
15 plaque immunoscreening with a panel of 28 sera from normal 
(2 donors) , human PT-NANBH-chronic (6 donors) , chimpanzee 
normal (7 donors) , chimpanzee PT-NANBH-acute (5 donors) , 
and chimpanzee PT-NANBH-chronic (8 donors) , with the 
results shown in Table 5 in Example 12. As can be seen in 
20 Table 5, the 5-1-1 and 409-1-1 (c-a) peptides reacted with 
most of the human and chimpanzee chronic sera, although the 
409-1-1 (c-a) peptide detected a higher percentage of human 
chronic sera samples (83% vs 66%) . The chronic human serum 
which was detected by the 409-1-1 (c-a) peptide, but not by 
25 5-1-1 was from a patient (BV) who died of fulminant NANBH 
infection. Because the 5-1-1 antigen is contained within 
the C-100 antigen in the commercially available kit format 
(Ortho/ Chiron) , it was of interest to determine whether the 
C-100 antigen gave a broader range of reactivity with the 
30 test sera. The results are shown at the right in Table 5 
below. The only human NANBH serum that was tested was the 
above BV serum which was not detected by 5-1-1. This serum 



WO 91/15516 



PCT/US91/02370 



26 



15 



30 



was also not immunoreactive with the c-100 antigen (0/1). 
Nor was the C-100 antigen reactive with any of the five 
acute chimp sera which were tested (0/5) . It is also noted 
that the 409-1-1 (c-a) antigen is immunoreactive with 3 of 
5 the 5 acute chimpanzee sera tested, compared with only l 
out of 5 for the 5-1-1 antigen. The results indicate that 
the 409-1-1 (c-a) antigen has broader immunospecificity with 
PT-NANBH sera, and thus would provide a superior diagnostic 
agent. The results obtained with 409-1-1 (c-a) are compara- 
10 ble £p the results obtained using 409-1-1 (abc) . 

It is noted here that the 409-1-1 (abc) coding sequence 
is contained in the p 4 /b, linking fragment and does not 
overlap the sequence of the C-100 (and 5-1-1) coding region 
which is in the F 4 /Rj and F 5 /R< linking fragments. The 
relatively long coding sequence of the 409-1-1 (abc) peptide 
illustrates that larger size digest fragments (substantial- 
ly greater than 300 basepairs) are generated in the partial 
digest step used in producing digest fragments for antigen 
expression. 

The 409-1-1 (abc) peptide, which forms one aspect of 
the invention, has the amino acid sequence which is 
presented as SEQ ID NO: 10. The DNA coding sequence 
corresponding to the insert in the 409-1-1 clone is given 
in Figure 5 and is presented as SEQ ID NO: 9. 

The 409-1-1 (c-a) peptide, which forms another aspect 
of the invention, has the amino acid sequence presented as 
SEQ ID NO: 8. The DNA coding sequence corresponding to the 
insert in the 409-1-1 (c-a) clone is given in Figure 6 and 
is presented as SEQ ID N0:7. The relationship between the 
coding sequence of 409-1-1 (c-a) and 409-1-1 (abc) is 
outlined in Example 12. Briefly, 409-1-1 (c-a) consists of 
a carboxy terminal region of 409-1-1 (abc) moved to the 
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amino terminus of the 409-1-1 coding sequence, with a 
truncation of the remaining 3' 409-1-1 (abc) coding se- 
quence. 

More generally, the invention includes a peptide 
5 antigen which is immunoreactive with sera from humans with 
HCV infection. Such peptide antigens are readily identifi- 
able by the methods of the present invention. 

Antigens obtained from the region corresponding to the 
HCV sequences encoding the 409-1-1 antigens were further 
10 character- 
ized as follows. The primers shown in Table 7 were used to 
generate a family of overlapping amplified fragments 
derived from this region. Several templates were used for 
the DNA amplification reactions (Table 8) . The relation- 
15 ships of the coding sequences of the resulting clones to 
each other are graphically illustrated in Figure 7. The 
amplified fragments were then cloned into lambda gtll 
vectors (Example 13). 

These cloned fragments were then immunoscreened 
20 (Example 13) . Seven of the nine clones tested positive by 
preliminary immunoscreening (Table 9) . These seven clones 
were then tested against a more extensive battery of PT- 
NANBH serum samples, including numerous human clinical 
samples. The sensitivity of the antigens, in decreasing 
25 order, for reactivity with the serum used for screening was 
as follows: 33cu > 33c > 409-1-1 (c-a) > 409-1-1-F1R2 > 
409-1-1 (abc) - 409-1-la > 5-1-1 > 409-1-1 (c+270) . As can 
be seen from these results all of the alternative clones, 
with the exception of 409-1-1 (c+270) , provided a more 
30 sensitive antigen than 5-1-1. However, although 33cu and 
33c were very sensitive antigens, in this assay they 
reacted slightly with serum which was known to be negative 
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for HCV and may therefore be less specific. Accordingly, 
the 409-1-1 series appears preferable for use as diagnostic 
antigens since they are more specific to HCV-induced 
antibodies. 

5 The immunoscreening was extended to include the clone 

36 and 45 encoded epitopes: the insert of clone 45 is 
essentially the same as the insert of clone 40 (Example 4) . 
As can be seen from the results presented in Table 11, the 
antigens produced by clones 36 and 40, while not as 

10 sensitive as 409-1-1 (c-a) , do yield HCV-specific immunopo- 
sitive signals with selected samples. Accordingly, the two 
methods presented in the present invention, (i) immuno- 
screening of cDNA libraries generated directly from sera- 
derived RNA, and (ii) immunoscreening of amplified-fragment 

15 libraries, can both be seen to be effective methods of 
identifying cDNA sequences encoding viral antigens. 
Further, confirmation of the clone 36 and 40 encoded 
antigens by identification of antigens corresponding to 
these HCV regions using the amplified-fragment library 

20 method validates the usefulness of the amplified-fragment 
method. 

B. Peptide Purification 

The recombinant peptides of the present invention can 

25 be purified by standard protein purification procedures 
which may include differential precipitation, molecular 
sieve chromatography, ion-exchange chromatography, isoelec- 
tric focusing, gel electrophoresis and affinity chromatog- 
raphy. In the case of a fused protein, such as the beta- 

30 galactosidase fused proteins prepared as above, the fused 
protein can be isolated readily by affinity chromatography, 
by passing cell lysis material over a solid support having 
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surface-bound anti-beta-galactosidase antibody. For 
example, purification of a beta-galactosidase/fusion 
protein, derived from 409-1-1 (c-a) coding sequences, by 
affinity chromatography is described in Example 14. 
5 A fused protein containing the 409-1-1 (a) peptide 

fused with glutathione-S-transf erase (Sj26) protein has 
also been expressed using the pGEX vector system in E. coli 
KM392 cells (Smith) . This expression system has the 
advantage that the fused protein is generally soluble and 
10 therefore can be isolated under non-denaturing conditions. 
The fused Sj26 protein can be isolated readily by glutathi- 
one substrate affinity chromatography (Smith) . This method 
of expressing this fusion protein is given in Example 15 
and is applicable to any of the other antigen coding 
15 sequences described by the present invention. 

Also included in the invention is an expression 
vector, such as the lambda gtll or pGEX vectors described 
above, containing the 409-1-1 (a) coding sequence and 
expression control elements which allow expression of the 
20 coding region in a suitable host. The coding sequence is 
contained in the sequence given above corresponding to 
basepairs 2755-3331 of the HCV genome. The control 
elements generally include a promoter, translation initia- 
tion codon, and translation and transcription termination 
25 sequences, and an insertion site for introducing the insert 
into the vector. In the case of the two vectors illu- 
strated in Example 15, the control elements control the 
synthesis of the protein which is fused with the heterolo- 
gous peptide antigen. Such expression vectors can be 
30 readily constructed for the other antigen coding sequences 
described by the present invention. 
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The lambda gtll vectors containing the following 
coding regions have been deposited with The American Type 
Culture Collection, 12301 Parklawn Dr., Rockville MD, 
20852: the 409-1-1 (abc) coding region, designated gtll/- 
5 409-l-l(abc), ATCC No. 40876; the 409-1-1 (c-a) coding 
region, designated gtll/409-l-i(c-a) ATCC No. 40792; clone 
36, designated gtll/36, ATCC No. 40901; and, clone 40, 
designated gtll/40, ATCC No. 40893. 

10 VI1 ^ Immunoreactive cinn» fi of the ffry-caosid Ani-i^ n 

At the 1990 Congress of Hepatology a region of the 
full-length HCV nucleic acid seguence was presented, 
nucleotide residues 325-970, containing the HCV non-coding,' 
structural core protein and envelope protein coding sequen- 
ces as capsid parts of a polyprotein sequence. During the 
course of experiments performed in support of the present 
invention, the coding region that corresponds to the capsid 
protein was more clearly defined. 

Polymerase Chain Reaction primers were constructed 
from selected HCV sequence which would generate amplifica- 
tion products of nucleotides 325-970 of the full length HCV 
genome (see Appendix). These primers, SF2(C) and SRl(C), 
are presented in Example 16. The primers contained non- 
complementary sequences which encoded restriction enzyme 
cleavage sites to facilitate subsequent cloning manipula- 
tions. The primers were used in amplification reactions 
containing SISPA-amplified HCV cDNA molecules (Example 7) 
as substrate. The resulting amplification products were 
cloned into the pGEX and pET vectors (Example 16). The 
30 pGEX vector allows expression of inserted coding sequences 
as fusion proteins to the sj26 protein, glutathione-s- 
transf erase. Insertion into the pET vector allows expres- 
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sion of the inserted coding sequences independent of fusion 
sequences. 

These clones were then immunologically screened using 
sera known to be reactive with HCV-antigens (Example 17) . 
Several clones in both vectors were identified which were 
immunoreactive with the anti-HCV sera (in pGEX, clones 14, 
15, 56, 60, and 65, Example 17, Table 13). it was observed 
that the fusion proteins which were produced from the 
clones in pGEX were smaller than expected. 

„ Clone 15 was selected for scaled up production of the 
Sj26/HCV-antigen fusion protein. The fusion protein 
product (approximately 29 kd) was smaller than the expected 
fusion product (approximately 50 kd, Example 17) . Further, 
the yield of the fusion protein from this preparation was 
15 unexpectedly low. 

Clones 15 and 56 were chosen for nucleic acid sequenc- 
ing of the HCV-antigen containing inserts (Example 18) . 
The sequences of the two clones were very similar with the 
exception that clone 15 had a termination codon starting at 
20 nucleotide position 126. This result suggested that the 
amino terminal 42 amino acids encoded by the HCV insert 
were immunogenic in regard to the anti-HCV sera used for 
immunoscreening . 

To test the suggestion that the amino terminus of the 
25 HCV polyprotein was antigenic, a synthetic oligopeptide was 
constructed essentially corresponding to amino acid 
residues 6-24 of Figure 8A: this peptide had very strong 
immunoreactivity with anti-HCV sera as tested by ELISA. 
PCR primers (Figure 8, CI and NC105) were designed to 
30 generate a clone corresponding to this region (Figure 10, 
C1NC105, SEQ ID N0:25). Three other synthetic peptides 
were tested, one of which was strongly immunoreactive with 
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anti-HCV sera (amino acid residues 47-74, Figure 8A) and 
two which were weakly immunoreactive (amino acid residues 
39-60 and 101-121, Figure 8A) . These synthetic peptides 
confirm the presence of a strong antigenic region at the 
5 amino-terminal end of the HCV polyprotein in the capsid 
protein region. 

The sequence of clone 56, designated pGEX-GGl-56, is 
shown in Figure 8A and is presented in the sequence listing 
as SEQ ID NO: 11. The sequence shows that the cone has a 

10 long, open reading frame. When production of the fusion 
protein was induced, a fusion protein smaller than the 
expected product was produced, similar in size to the clone 
15 product. The nucleotide sequence of the clones revealed 
a region which is prone to translational frameshifting, 

15 AAAAAAAAAA (Atkins et al., Wilson et al.). Such a nucleo- 
tide sequence may contribute to the low protein yields when 
these clones are expressed in E. coli. In an effort to 
improve the level of fusion protein expression the third 
nucleotide position of several codons through this region 

20 was changed to a G resulting in the sequence AGAAGAAGAA 
(Example 20) : the changes had no effect on the protein 
coding sequence (amino acid residues 8-10, Figure 8A) . 
This modified insert was cloned into the pGEX vector and 
the resulting plasmid named pGEX-CapA. 

25 A hydropathicity plot was generated for the protein 

coding sequences of the insert of pGEX-GGl (Example 19, 
Figure 9) . The results of this analysis indicated that the 
carboxy-terminal region of the encoded protein, approxi- 
mately amino acid residues 168-182, had the potential for 

30 being a membrane spanning segment. Since it was unlikely 
that the membrane spanning segment would provide a strong 
antigen and since overproduction of proteins with these 
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regions can adversely affect the growth of bacterial cells, 
a series of carboxy terminal deletions were generated from 
pGEX-CapA (Example 20) . 

To generate the carboxy terminal deletions PCR primers 
5 were designed to be complementary to various regions of the 
PGEX-CapA insert encoded protein. The primers used to 
generate the carboxy terminal deletions are given in Table 
14 and the location of the primers relative to the insert 
coding sequence is presented in Figure 8B. The carboxy 
10 terminal deletion fragments were cloned into the pGEX 
vector and Sj26/HCV-insert fusion proteins were produced. 
These fusion proteins were then screened with anti-HCV sera 
and an epitope map generated for the immunoreactive 
polypeptides (see Figure 10). Clones C1NC270, C1NC360, and 
C1NC450 all expressed high levels of the Sj26/HCV fusion 
proteins. Further, these fusion proteins all corresponded 
to the size predicted from their nucleic acid coding 
sequences. Clones C1NC520 and C1NC580 gave poor yields of 
fusion proteins suggesting that when the hydrophobic region 
of amino acid residues 168-182 is present it may in part be 
responsible for the poor protein yields previously ob- 
tained. 

The deletion analysis was continued to further dissect 
the antigenic regions of the pGEX-CapA encoded HCV antigen. 
25 a series of amino terminal deletions (primers in Table 15) 
combined with carboxy terminal deletions were generated 
using PCR primers: the locations of all the primers are 
illustrated in Figure 8B. 

The results of the deletion analysis are presented in 
30 Table 16 and in Figure 10. These results, combined with 
the synthetic peptide data presented above, suggest that 
the capsid protein (which comprises the N-terminus of the 
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HCV polyprotein) has two dominant immunoreactive regions. 
Both of these immunoreactive regions are useful use as 
diagnostic antigens. The region comprising the first 35 
amino acids spans one of the epitopes and the region 
5 spanning residues 34-90 encompasses the other strongly 
immunoreactive domain. 

In summary, all of the pGEX clones containing the N- 
terminus of the HCV polyprotein and either 34, 90, 120 or 
150 residues produced large quantities of fusion protein 
10 which, was shown to be efficiently recognized by HCV 
positive sera. Expression of the PCR inserts containing 
amino acid residues 34-90 was also strongly immunoreactive, 
whereas inserts encoding residues 90-120 or 90-150 were not 
immunoreactive, demonstrating that these regions were not 
15 recognized by human sera. This result suggests that the 
regions important for the production of recombinant 
antigens is contained between residues l through 90. 

Analyses of the pGEXClNC450 protein and the pET360 
protein showed that the inclusion of these antigens in 
20 Western and ELISA formats permitted the identification of 
HCV positive sera which had been previously identified as 
either HCV negative or HCV indeterminate. Accordingly, the 
inclusion of these epitopes permits the generation of an 
improved screening system (Example 21) . 
25 VIII. Ant i -HCV Antigen A ntibodies 

In another aspect, the invention includes antibodies 
specific against the recombinant antigens of the present 
invention. Typically, to prepare antibodies, a host 
animal, such as a rabbit, is immunized with the purified 
30 antigen or fused protein antigen. The host serum or plasma 
is collected following an appropriate time interval, and 
this serum is tested for antibodies specific against the 
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antigen* Example 15 describes the production of rabbit 
serum antibodies which are specific against the 409-1-1 
antigens in the Sj26/409-l-l(a) and beta-galactosidase/409- 
1-1 (c-a) fusion protein. These techniques are equally 
5 applicable to the other antigens of the present invention. 
The gamma globulin fraction or the IgG antibodies of 
immunized animals can be obtained, for example, by use of 
saturated ammonium sulfate or DEAE Sephadex, or other 
techniques known to those skilled in the art for producing 
10 polyclonal antibodies. 

Alternatively, the purified antigen or fused antigen 
protein may be used for producing monoclonal antibodies. 
Here the spleen or lymphocytes from an immunized animal are 
removed and immortalized or used to prepare hybridomas by 
15 methods known to those skilled in the art. To produce a 
human-human hybridoma, a human lymphocyte donor is select- 
ed. A donor known to be infected with an HCV virus (where 
infection has been shown for example by the presence of 
anti-virus antibodies in the blood) may serve as a suitable 
20 lymphocyte donor. Lymphocytes can be isolated from a 
peripheral blood sample or spleen cells may be used if the 
donor is subject to splenectomy. Epstein-Barr virus (EBV) 
can be used to immortalize human lymphocytes or a human 
fusion partner can be used to produce human-human hybrido- 
25 mas. Primary in vitro immunization with peptides can also 
be used in the generation of human monoclonal antibodies. 

Antibodies secreted by the immortalized cells are 
screened to determine the clones that secrete antibodies of 
the desired specificity, for example, using the Western 
30 blot method described in Example 15. 
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IX. Utjljty 

A. Diagnostic Method and Kit 

The antigens obtained by the methods of the present 
invention are advantageous for use as diagnostic agents for 
5 anti-HCV antibodies present in HCV-inf ected sera; particu- 
larly, the 409-1-1 antigens (409-1-1 (abc) , 409-1-1 (c-a) , 
and related antigens (see Table 9) ; the clone 36 antigen; 
and, the clone 40 antigen and the capsid antigen. As noted 
above, many of the antigens provide the advantage over 

10 known. HCV antigen reagents 5-1-1 and C-100 in that they are 
immunoreactive with a wider range of PT-NANBH infected 
sera, particularly acute-infection sera. This is particu- 
larly true of combinations of the 409-1-1 antigens with the 
HCV-core protein antigens as described in Section VII 

15 above. The antigens 409-1-1 (c-a) and Cap450 have been 
combined in an ELISA test kit and tested against HCV test 
kits produced by Abbott and Ortho. The antigens of the 
present invention consistently identify more HCV+ samples 
with a high degree of specificity which is comparable to or 

20 better than the Abbott and Ortho test kits. 

In one preferred diagnostic configuration, test serum 
is reacted with a solid phase reagent having a surface- 
bound HCV antigen (or antigens) obtained by the methods of 
the present invention, e.g., the 409-1-1 (c-a) antigen and 

25 the Cap450 antigen. After binding anti-HCV antibody to the 
reagent and removing unbound serum components by washing, 
the reagent is reacted with reporter-labeled anti-human 
antibody to bind reporter to the reagent in proportion to 
the amount of bound anti-PT-NANBH antibody on the solid 

30 support. The reagent is again washed to remove unbound 
labeled antibody, and the amount of reporter associated 
with the reagent is determined. Typically, the reporter is 
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an enzyme which is detected by incubating the solid phase 
in the presence of a suitable f luorometric or colorimetric 
substrate. 

The solid surface reagent in the above assay is 
5 prepared by known techniques for attaching protein material 
to solid support material, such as polymeric beads, dip 
sticks, 96-well plate or filter material. These attachment 
methods generally include non-specific adsorption of the 
protein to the support or covalent attachment of the 
10 protein, typically through a free amine group, to a 
chemically reactive group on the solid support, such as an 
activated carboxyl, hydroxyl, or aldehyde group. 

In a second diagnostic configuration, known as a 
homogeneous assay, antibody binding to a solid support 
15 produces some change in the reaction medium which can be 
directly detected in the medium. Known general types of 
homogeneous assays proposed heretofore include (a) spin- 
labeled reporters, where antibody binding to the antigen is 
detected by a change in reported mobility (broadening of 
20 the spin splitting peaks) , (b) fluorescent reporters, where 
binding is detected by a change in fluorescence efficiency, 
(c) enzyme reporters, where antibody binding effects 
enzyme/ substrate interactions, and (d) liposome-bound 
reporters, where binding leads to liposome lysis and 
25 release of encapsulated reporter. The adaptation of these 
methods to the protein antigens of the present invention 
follows conventional methods for preparing homogeneous 
assay reagents. 

In each of the assays described above, the assay 
30 method involves reacting the serum from a test individual 
with the protein antigen and examining the antigen for the 
presence of bound antibody. The examination may involve 
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15 



attaching a labeled anti-human antibody to the antibody 
being examined, either IgM (acute phase) or IgG (convales- 
cent or chronic phase), and measuring the amount of 
reporter bound to the solid support, as in the first 
method, or may involve observing the effect of antibody 
binding on a homogeneous assay reagent, as in the second 
method. 

Also forming part of the invention is an assay system 
or kit for carrying out the assay method just described. 
The., kit generally includes a support with surface-bound 
recombinant HCV antigen (e.g., the 409-1-1 antigens, etc., 
as above), and a reporter-labeled anti-human antibody for 
detecting surface-bound anti-PT-NANBH-antigen antibody. 

As discussed in Section III above, peptide antigens 
associated with several of the linking-fragment libraries 
are immunoreactive with acute NANBH sera from chimpanzees, 
indicating that the peptides would be useful for detecting 
acute NANBH infection in human serum. In particular, one 
or more peptide antigens produced by the linking fragment 
libraries, F 8 /R, (reactive with chronic sera), F 3 I^, f 6 b 12/ 
FnR?* F?Rs/ or F 7 R 8 (which are shown in Example li to produce 
one or more peptide antigens which are immunoreactive with 
acute chimpanzee sera) can be combined with the 409-1-1 
antigens to provide a diagnostic composition capable of 
25 immunoreacting with a high percentage of both chronic and 
acute human NANBH serum samples. Further, as discussed in 
Section VII above inclusion of the HCV-capsid protein 
antigens of the present invention add an extra level of 
sensitivity. 

30 A third diagnostic configuration involves use of the 

anti-HCV antibodies, described in Section VI above, capable 
of detecting HCV specific antigens. The HCV antigens may 
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be detected, for example, using an antigen capture assay 
where HCV antigens present in candidate serum samples are 
reacted with an HCV specific monoclonal antibody. The 
monoclonal antibody is bound to a solid substrate and the 
5 antigen is then detected by a second, different labelled 
anti-HCV antibody: the monoclonal antibodies of the 
present invention which are directed against HCV specific 
antigens are particularly suited to this diagnostic method. 

10 B. -Peptide Vaccine 

The HCV antigens identified by the methods of the 
present invention, e.g. 409-1-1 (c-a) and HCV-core protein 
antigens, can be formulated for use in a HCV vaccine. The 
vaccine can be formulated by standard methods, for example, 
15 in a suitable diluent such as water, saline, buffered 
salines, complete or incomplete adjuvants, and the like. 
The immunogen is administered using standard techniques for 
antibody induction, such as by subcutaneous administration 
of physiologically compatible, sterile solutions containing 
20 inactivated or attenuated virus particles or antigens. An 
immune response producing amount of virus particles is 
typically administered per vaccinizing injection, typically 
in a volume of one milliliter or less. 

A specific example of a vaccine composition includes, 
25 in a pharmacologically acceptable adjuvant, a recombinant 
409-1-1 (c-a) peptide. The vaccine is administered at 
periodic intervals until a significant titer of anti-HCV 
antibody is detected in the serum. Such vaccines can also 
comprise combinations of the HCV antigens of the present 
30 invention. 
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C. Passive Immunoprophylaxis 

The anti-HCV antibodies of the invention can be used 
as a means of enhancing an anti-HCV immune response since 
antibody-virus complexes are recognized by macrophages and 
5 other effector cells. The antibodies can be administered in 
amounts similar to those used for other therapeutic 
administrations of antibody. For example, pooled gamma 
globulin is administered at 0.02-0.1 ml/ lb body weight 
during the early incubation of other viral diseases such as 

10 rabies, measles and hepatitis B to interfere with viral 
entry into cells. Thus, antibodies reactive with, for 
example, the 409-1-1 (c-a) antigen can be passively adminis- 
tered alone in a "cocktail" with other anti-viral antibod- 
ies or in conjunction with another anti-viral agent to a 

15 host infected with a PT-NANBH virus to enhance the immune 
response and/or the effectiveness of an antiviral drug. 

The following examples illustrate various aspects of 
the invention, but are in no way intended to limit the 
20 scope thereof. 

Materials 

E. coli DNA polymerase I (Klenow fragment) was 
obtained from Boehringer Mannheim Biochemicals (Indianapo- 
25 lis, IN). T4 DNA ligase and T4 DNA polymerase were 
obtained from New England Biolabs (Beverly, MA) ; Nitrocel- 
lulose filters were obtained from Schleicher and Schuell 
(Keene, NH) . 

Synthetic oligonucleotide linkers and primers were 
30 prepared using commercially available automated oligonu- 
cleotide synthesizers. Alternatively, custom designed 
synthetic oligonucleotides may be purchased, for example, 
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from Synthetic Genetics (San Diego, CA) . cDNA synthesis 
kit and random priming labeling kits were obtained from 
Boehringer-Mannheim Biochemical (BMB, Indianapolis, IN) . 

5 Example l 

Construction ef NANB-eo ntainina cDNA libraries 

A. Infection of a Chimpanzee with HCV 

A chimpanzee (#771) was inoculated with a Factor VIII 
preparation which was known to cause parenterally transmit- 

10 ted-.non-A non-B hepatitis (PT-NANBH) in human patients 
treated with the Factor VIII concentrate (Bradley) . Post- 
infection ultrastructural changes in liver tissue were 
observed by electron microscopy and ALT (alanine amino 
transferase) elevation was observed in the infected 

15 chimpanzee. These observations are consistent with PT- 
NANBH infection. 

B. Isolation of RNA from Sera 

Serum was collected from the above described infected 
20 chimpanzee (#771) and four human PT-NANBH clinical sources 
(EGM, BV, CC and WEH) . Ten milliliters of each undiluted 
serum was pelleted by centrifugation at 3 OK, for 3 hours in 
an SW40 rotor, at 4°C. RNA was extracted from each result- 
ing serum pellet using the following modifications of the 
25 hot phenol method of Feramisco et al. Briefly, for each 
individual serum sample, the pellet was resuspended in 0.5 
ml of 50 mM NaOAc, pH=4.8, containing 1% SDS. An equal 
volume of 60°C phenol was added and incubated for 15 minutes 
at 60°C with occasional vortexing. This mixture was 
30 transferred to a 1.5 ml microfuge tube and spun for two 
minutes at room temperature in a table top microfuge. The 
aqueous phase was transferred to a new microfuge tube. To 
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the aqueous phase, 50 pi of 3 M NaOAc, pH=5.2, and two 
volumes of 100% ethanol were added. This solution was held 
at -70°c for approximately 10 minutes and then spun in a 
microfuge at 4°C for 10 minutes. The resulting pellet was 
5 resuspended in 100 nl of sterile glass distilled water. To 
this solution 10 nl of NaOAc, pH=5.2, and two volumes of 
100% ethanol were added. The solution was held at -70°C for 
at least 10 minutes. The RNA pellet was recovered by 
centrifugation in a microfuge at 12,000 X g for 15 minutes 
10 at 5°C. The pellet was washed in 70% ethanol and dried 
under vacuum. 

C. Synthesis of cDNA 

(i) First Strand Synthesis 

15 The synthesis of cDNA molecules was accomplished as 

follows. The above described RNA preparations were each 
resuspended in 26 Ml of sterile glass distilled water 
(treated with diethyl pyrocarbonate, Maniatis et al.), 5 Ml 
of 10 X reaction buffer (0.5 M Tris HC1, pH=8.5; 0.4 M KC1; 

20 0.1 M MgCl 2 ; 4 mM DTT) , 10 Ml of a nucleotide solution 
(dGTP, dATP, dTTP, and dCTP, each at a concentration of 5 
mM), 5 Ml random primer, 0.25 Ml of ^P-dCTP, 2 /xl AMV 
reverse transcriptase, and 2 Ml of RNASIN ( Pr omega ) , in a 
total reaction volume of 50 Ml. This mixture was incubated 

25 for one hour at 42°C. 

(ii) Second Strand cDNA Synthesis 

To the first strand synthesis reaction mixture the 
following components were added: 55 til of 2 X second 
strand synthesis buffer (50 mM Tris HC1, pH=7.0; 60 mM 
30 KC1); 2 til RNase H; 5 Ml DNA polymerase I, and 2 Ml of the 
above described nucleotide solution. The reaction was 
incubated for one hour at 12°C, followed by a one hour 
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incubation at room temperature. The reaction mixture was 
extracted with an equal volume of 1:1 phenol /chloroform, 
followed by an extraction using 24:1 chloroform/ isoamyl 
alcohol. To each reaction mixture l nl of 10 mg/ml tRNA 
5 was added as carrier. The cDNA was precipitated by the 
addition of two volumes of 100% ethanol and chilling at - 
70°C for 15 minutes. The cDNA was collected by centrifu- 
gation, the pellet washed with 70% ethsnol and dried under 
vacuum. 

10 ..(iii) Preparation of the Double Stranded cDNA for 

cloning 

To provide vector compatible ends each of the double 
stranded cDNA preparations was tailed with EcoRI linkers in 
the following manner. 
15 The cDNA was treated with EcoRI methylase under the 

following conditions: The cDNA pellet was resuspended in 
20 fil lx methylase buffer (50 mM Tris HC1, pH=7.5; 1 mM 
EDTA; 5 mM DTT) , 2 »1 0.1 mM S-adenosyl-methionine (SAM) and 
2 nl EcoRI methylase (New England Biolabs) . The reaction 
20 was incubated for 30 minutes at 37°C. TE buffer (10 mM 
Tris-HCl, pH=7.5; 1 mM EDTA, pH=8.0) was added to achieve 
a final volume of 80 f/1. The reaction mixture was extracted 
with an equal volume of phenol /chloroform (1:1) and then 
with an equal volume of chloroform/ isoamyl alcohol (24:1). 
25 The cDNA was precipitated with two volumes of ethanol. 

To maximize the number of blunt ends for the addition 
of linkers (Maniatis et al, 1982) the cDNA was then treated 
with the Klenow fragment of DNA polymerase I. The pelleted 
cDNA was resuspended in 11.5 |/1 of distilled water. The 
30 following components were added to the resuspended cDNA: 
4 (jl of 5 X NTB (10 X NTB stock solution: 0.5 M Tric.Cl 
pH=7.2; 0.1 M MgS0 4 ; 1 mM dithiothreitol (DTT); 500 fjg/ml 
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bovine serum albumin (BSA) ) ; 3 til 0.1 M MgCl 2 , 1.5 /il 10GATC 
(a solution containing 10 mM of each nucleotide G, A, T, 
and C) , and 1 //l Klenow (Boehringer Mannheim Biochemicals) . 
The reaction mixture was incubated at room temperature for 
5 30 minutes. The reaction mixture was extracted with 
phenol/chloroform and chloroform isoamyl alcohol as 
described above, and then precipitated with two volumes of 
ethanol. 

The cDNA pellet was resuspended in 12 ?1 distilled 

10 water. To the resuspended linkers the following components 
were added: 5 til EcoRl phosphorylated linkers (New England 
Biolabs), 2 v l lox ligation buffer (0.66 M Tris.Cl pH=7.6, 
50 mM MgCl 2 , 50 mM DTT, 10 mM ATP) and 1 fil T4 DNA ligase. 
The reaction was incubated at 14 °c overnight. The follow- 

15 ing morning the reaction was incubated at 67 °C for three 
minutes to inactivate the ligase, then momentarily chilled. 
To the ligation reaction mixture 2.5 til of 10 X high salt 
restriction digest buffer (Maniatis et al.) and 2.5 »1 ot 
EcoRI enzyme were added and the mixture incubated at 37 °c 

20 for at least 6 hours to overnight. To remove excess 
linkers the digestion mixture was loaded onto a 1.2% 
agarose gel and the reaction components size fractionated 
by electrophoresis, size fractions of the 0.3-1.3 Kb and 
1.3-7 Kb ranges were electroeluted onto NA45 paper 

25 (Schleicher and Schuell) . The NA45 paper, with the eluted 
cDMA bound to it, was placed in a 1.5 ml microfuge tube 
containing 0.5 ml of elution solution (50 mM arginine, l M 
NaCl, pH=9.0). The tube was then placed at 67 °c for 
approximately one hour to allow the cDNA to be eluted from 

30 the paper into the solution. The solution was then 
phenol/chloroform, chloroform/ isoamyl alcohol extracted and 
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precipitated with two volumes of ethanol. The resulting 
cDNA pellets were resuspended in 20 pi TE (pH=7.5). 
(iv) Cloning of the cDNA into Lambda Vectors 
The linkers used in the construction of the cDNAs 
5 contained an EcoRI site which allowed for direct insertion 
of the amplified cDNAs into lambda gtlO and gtll vectors 
(Promega, Madison WI) . Lambda vectors were purchased from 
the manufacturer (Promega) which were already digested with 
EcoRI and treated with bacterial alkaline phosphatase, to 
10 remove the 5' phosphate and prevent self -ligation of the 
vector. 

The £coJ?I-linkered cDNA preparations were ligated into 
both lambda gtlO and gtll (Promega) . The conditions of the 
ligation reactions were as follows: l pi vector DNA 
15 (Promega, 0.5 mg/ml) ; 0.5 or 3 4 of insert cDNA; 0.5 pi 10 
X ligation buffer (0.5 M Tris-HCl, pH=7.8; 0.1 M MgCl 2 ; 0.2 
M DTT; 10 mM ATP; 0.5 g/ml BSA) , 0.5 ftl T4 DNA ligase (New 
England Biolabs) and distilled water to a final reaction 
volume of 5 fil. 

20 The ligation reaction tubes were placed at 14 °c 

overnight (12-18 hours) . The ligated cDNA was packaged the 
following morning by standard procedures using a lambda DNA 
packaging system (GIGAPAK, Stratagene, LaJolla, CA) , and 
then plated at various dilutions to determine the titer and 

25 recombinant frequency of the libraries. A standard X-gal 
blue/white assay was used to screen the lambda gtll 
libraries (Miller; Maniatis et al.). E. coli HG415 (from 
Howard Gersenfeld, Dept. of Pathology, Stanford School of 
Medicine) plating bacteria, which allows only plague 
30 formation by recombinant clones, was used for plating the 
lambda gtlO libraries. The standard strain, E. coli C600hF- 
may be used as an alternative to E. coli HG415. 
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Example ? 

Screening the cDNA library for production of PT-NAWWf 
antigens 

The five lambda gtll libraries generated in Example l 
were screened for specific HCV encoded viral antigens by 
immunoscreening. The phage were plated for plaque forma- 
tion using the Escherichia coli bacterial plating strain E. 
coli KM392 (Kevin Moore, DNax, Palo Alto, CA) . Alterna- 
tively, E. coli Y1088 may be used. . The fusion proteins 
expressed by the lambda gtll clones were screened with 
serum antibodies (Young et al.) from the following sources: 
chimpanzee #771 and various human PT-NANBH sera (including 
EGM, BV, WEH and AG) . 
15 From the lambda gtll libraries (Example l) approxi- 

mately in independent clones gave a positive immunological 
reaction with at least one of the chimp or human PT-NANBH 
sera. These phage clones were plague purified and the 
recombinant phage grown for DNA purification (Maniatis et 
20 al.). 



25 



30 



Example 3 

genomic Hybridization Screening n f immunonn^tive clones 
Out of the in plaque purified recombinant phage, 
obtained as in Example 2, 93 were isolated (Maniatis et 
al.) and digested with EcoRI as per the manufacturer's 
instructions (Bethesda Research Laboratories, Gaithersburg, 
MD). Approximately l.o microgram of each digested phage 
DNA sample was loaded into sample wells of 1.0% agarose 
gels prepared using TAE (0.04 m Tris Acetate, 0.001 M 
EDTA). The DNA samples were then electrophoretically 
separated. DNA bands were visualized by ethidium bromide 
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staining (Maniatis et al.). Inserts were clearly identi- 
fied for each of the 93 clones, purified by electroelution 
using NA45, and then radioactively labelled by nick 
translation (Maniatis et al.). 
5 Human peripheral blood lymphocyte (PBL) DNA was 

restriction digested with JSrindJIJ and EcoRI, loaded on a 
0.7% agarose gel (as above, except 10 Mg of DNA was loaded 
per lane) and the fragments separated c.lectrophoretically. 
The DNA fragments in the agarose gels were transferred to 
10 nitrpcellulose filters (Southern) and the genomic DNA 
probed with the nick-translated lambda gtll inserts which 
were prepared above. 

The filters were washed (Southern; Maniatis et al.) 
and exposed to X-ray film. Forty-three of the 93 lambda 
15 clone inserts displayed a positive hybridization reaction 
with the human PBL DNA. Among the remaining inserts which 
clearly did not hybridize with the PBL DNA, were 11 inserts 
derived from chimp #771 clones which were also clearly 
immunopositive from Example 2. of these 11 clones, two of 
20 the clones had the immunoreactive characteristics summa- 
rized in Table 1. Chimpanzee #771 and humans Ag, BV and 
WEH were chronimc PT-NANBH sera samples and SKF was a 
normal human serum sample. 

25 

Table 1 

Clone Designation 
36 40 

+ + 
+ 
+ 



30 



35 



Sera 



#771 
AG 
BV 
WEH 



SKF 
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Clone 40 (original clone screening designation 304-12-1) 
was clearly exogenous, i.e., not derived from normal human 
DNA, as evidenced by repeated hybridization tests against 
normal human PBL DNA, and a second clone, designated clone 
5 36 (original clone screening designation 303-1-4) , was not 
only exogenous but also reactive with multiple PT-NANBH 
antisera. 

Example A 

10 -• Sequencin g of clnn c>g 

DNA sequencing was performed on clones 36 and 40 as 
described in Example 3. Commercially available sequencing 
primers (New England Biolabs) homologous to flanking lambda 
sequences at the 5' and 3' ends of the inserts were 

15 initially used for sequencing. As sequencing progressed 
primers were constructed to correspond to newly discovered 
sequences. Synthetic oligonucleotide primers were prepared 
using commercially available automated oligonucleotide 
synthesizers. Alternatively, custom designed synthetic 

20 oligonucleotides may be purchased, for example, from 
Synthetic Genetics (San Diego, CA) . 

DNA sequences were determined for the complete insert 
of clone 40 (presented as SEQ ID N0:1 and also shown in 
Figure 3) ; this sequence corresponds to nucleotides 6516 to 

25 7070 of the HCV genome (Appendix). Subsequently, the 
inserts present in clones 44 and 45 (2 other clones of the 
11 clones identified in Example 3) were found to cross- 
hybridize to the clone 40 insert. Partial sequencing of 
clones 44 and 45 showed that the sequences obtained from 

30 these two clones matched the sequence of clone 40. A par- 
tial sequence of the clone 36 insert was determined and is 
presented as SEQ ID NO: 3; the complete sequence is pre- 
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sented as SEQ ID NO: 5 and is also shown in Figure 4. The 
sequence of clone 36 corresponds to nucleotides 5010 to 
6515 given in the Appendix. 

5 

Example 5 

Screening of the cDNA library in lambda atlO 
The cDNA libraries in lambda gtlO, generated in 
Example 1, were screened for the presence of sequences 
10 homologous to the clone 40 insert. 

The lambda gtlO libraries were plated at a density of 
approximately 10 4 plaques/plate and plaques lifts were 
prepared according to Maniatis et al. Filters were indexed 
using india ink to allow alignment of the filters with the 
15 parent plate from which the plaque lift was performed. The 
bacteria and phage particles were lysed, and the nitrocel- 
lulose filters were processed and' baked as previously 
described (Maniatis et al.). The^ehybridization solu- 
tion, per filter, consisted of the following: 5.4 ml 
20 prehybridization buffer (50 ml of 1 M Tris HC1, pH=7.5; 2 
ml of 0.5 M EDTA, pH=8.0; 50 ml of 10% SDS; 150 ml of 20 X 
SSC (Maniatis et al.); and, 238 ml of glass distilled 
water); 6.0 ml formamide; 0.4 ml 50 X Denhardt solution (5 
g FICOLL; 5 g polyvinylpyrrolidone; 5 g bovine serum 
25 albumin; brought to a total volume of 500 ml with glass 
distilled water) ; and 0.2 ml of single-stranded salmon 
sperm DNA (10 mg/ml) . Each filter was placed in a plastic 
bag and the prehybridization solution was added. The bag 
was sealed and incubated at 37°c overnight with intermittent 
30 mixing of contents. 

The clone 40 lambda DNA was isolated (Maniatis et al.) 
and digested with EcoRI. The resulting fragments were 
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15 



fractionated on an agarose gel and visualized by ethidium 
bromide staining (Maniatis et al.). The DNA fragment 
corresponding to the clone 40 insert, approximately 500 
base pairs, was isolated from the agarose by electroelution 
5 onto NA45. The aqueous suspension of the purified fragment 
was extracted once with a 1:1 phenol/ chloroform solution, 
and once with a 24:1 chloroform/ isoamylalcohol solution. 
The DNA was then precipitated with ethanol and resuspended 
in sterile water. 

10 The clone 40 insert was radioactively labelled by nick 

translation and used to probe the lambda gtlO plague lift 
filters. The prehybridization solution was removed from 
the filters. Each filter was hybridized with probe under 
the following conditions: 5.0 ml of hybridization buffer 
(5 ml of 1 M Tris HC1, pH=7.5; 0.2 ml of 0.5 M EDTA, 
PH=8.0; 5.0 ml of 10% SDS; 14.9 ml of 20 X SSC (Maniatis et 
al.); io g of dextran sulfate; and, glass distilled water 
to a total volume of 50 ml); 5.0 ml formamide; 0.4 ml 50 X 
Denhardt's solution (5 g FICOLL; 5 g polyvinylpyrrolidone; 
20 5 g bovine serum albumin; brought to a total volume of 500 
ml with glass distilled water); and 0.2 ml of single- 
stranded salmon sperm DNA (10 mg/ml) . To this hybridiza- 
tion mix was added 50-250 Ml of denatured probe (boiled 5- 
10 minutes and quick-chilled on ice) , resulting in approxi- 
25 mately 10 6 cpm of labelled probe per filter. The hybridiza- 
tion mix containing the labelled probe was then added to 
the plastic bag containing the filters. The bag was 
resealed and placed under a glass plate in a 37°c water bath 
overnight with intermittent mixing of contents. 
30 The next day the hybridization solution was removed 

and the filters washed three times, for 5 minutes each, in 
2 X SSC (Maniatis et al.) containing 0.5% SDS, at room 
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temperature. The filters were then washed for one hour in 
2 X SSC, containing 0.5% SDS, at 50°C. The filters were 
then washed for 15-60 minutes in 0.1 X SSC, containing 0.1% 
SDS, at 50°C and finally 2 X SSC, 15 minutes, 2-3 X at room 
5 temperature. The washed filters were dried and then 
exposed to X-ray film for detection of positive plaques. 

Approximately 24 plaques from the lambda gtlO librar- 
ies were plaque purified from the approximately 200 plaques 
which tested positive by the hybridization screen (Table 
10 2) . 

Table 2 

Library cDNA Positives/ 

15 Source Plate 



20 



25 



EGM Human =50 

BV Human sioo 

WEH Human s25 

#771 Chimp =10-15 



Example 6 

Analysis of lambd a at 10 cQNA Library Clones Homologous 

to the Clone 40 insert 

30 The clones identified in Example 5 which have homology 

to the clone 40 insert were analyzed by standard restric- 
tion analysis and the insert sizes were determined. The 
original frequencies of positive hybridization signals per 
plate using the clone 40 insert as probe against the 

35 different cDNA sources are shown in the last column of 
Table 2. That these positive signals arose with different 
frequencies for the different cDNA sources in the lambda 
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10 



gtlO library suggests that the hybridization signals 
originated from the sera source rather than common contami- 
nation introduced during cDNA synthesis or cloning. 

One of the clones (108-2-5) from the EGM-generated 
cDNA library identified by hybridization with the clone 40 
insert, had an insert of approximately 3.7 kb and was 
chosen for further analysis. The insert was isolated by 
EcoRI digestion of the clone, electrophoretic fraction- 
ation, and electroelution (Example 5). The insert was 
treated with DNase I under conditions resulting in partial 
digestion (Maniatis et al.) to generate random fragments. 
The resulting fragments were inserted into lambda gtll 
vectors for expression. The lambda gtll clones were then 
immunoscreened (Example 2) using human (BV and normal) and 
15 chimpanzee #771 sera. Twelve positive clones were identi- 
fied by first round immunoscreening with the human and 
chimp sera. Seven of the 12 clones were plaque purified 
and rescreened using chimp serum (#771). Partial DNA 
sequences of the insert DNA were determined for two of the 
20 resulting clones that had the largest sequences, designated 
328-16-1 and 328-16-2. The 2 clones had sequences essen- 
tially identical to clone 40. 

Example 7 

25 Preparing Amplifie d HCV cTWA Fragments 

A. Preparing cDNA fragments 

A plasma pool obtained from a chimpanzee with chronic 
PT-NANBH was obtained from the Centers for Disease Control 
(CDC) (Atlanta, GA) . After direct pelleting or PEG 

30 precipitation, RNA was extracted from the virions by 
guanidinium thiocyanate-phenol-chloroform extraction, 
according to published methods (Chomczynski) . The pelleted 
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RNA was used for cDNA synthesis using oligo dT or random 
primers, or HCV sequence-specific primers and a commercial 
cDNA kit (Boehringer-Mannheim) . 

In one method, synthesis of first strand cDNA was 
5 achieved by addition of four primers, designated A, B, c, 
and D, having the sequences shown below. These sequences 
are complementary to the HCV genomic regions indicated: 
A: 5 ' -GCGGAAGCAATCAGTGGGGC-3 ' , complementary to basepairs 
394-413; 

10 B: 5 '-GCCGGTCATGAGGGCATCGG-3', complementary to basepairs 
2960-2980; 

C: 5 ' -CGAGGAGCTGGCCACAGAGG-3 ' , complementary to basepairs 
5239-5258; and 

D: 5 ' -TGGTTCTATGGAGTAGCAGGCCCCG-3 ' , complementary to 
15 basepairs 7256-7280. 

Second strand cDNA synthesis was performed by the 
method of Gubler and Hoffman. The reactions were carried 
out under standard cDNA synthesis methods given in the 
commercial kit. 

20 

B. Amplifying the cDNA Fragments 

The cDNA from above was blunt ended and ligated to the 
linker/primer having the following sequence: 

25 Linker /primer: 5'-GGA ATT CGC GGC CGC TCG-3' A-strand 

3'-TT CCT TAA GCG CCG GCG AGC-5' B-Strand 

The cDNA and linker were mixed at a 1:100 molar ratio in 
30 the presence of 0.3 to 0.6 Weiss units of T4 DNA ligase. 
To 100 ftl of 10 mM Tris-Cl buffer, pH 8.3, containing 1.5 
mM MgCl 2 and 50 mM KC1 (Buffer A) was added about 1 x 10-3 
ug of the linker-ended cDNA, 2 //M of linker /primer A (A- 
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strand) having the sequence d(5'-GGAATTCGCGGCCGCTCG-3') , 
200 fM each of dATP, dCTP, dGTP, and dTTP, and 2.5 units of 
Thermus aquaticus DNA polymerase (Taq polymerase) . The 
reaction mixture was heated to 94°C for 30 sec for denatur- 
5 ation, allowed to cool to 50°C for 30 sec for primer 
annealing, and then heated to 72°C for 0.5-3 minutes to 
allow for primer extension by £ag polymerase. The replica- 
tion reaction, involved successive heating, cooling, and 
polymerase reaction, was repeated an additional 25 times 
10 with the aid of a Perkin-Elmer Cetus DNA thermal cycler. 
This results in a pool of SISPA (sequence-independent 
single primer amplification) -amplified DNA fragments. 

Example 8 

15 Preparing Primer- Pair Fragments 

Amplified cDNA fragments from Example 7 were mixed 
with 100 til Buffer A, 1 //M of equal molar amounts of one of 
the primer pairs given below, 200 fM each of dATP, dCTP, 
dGTP, and dTTP, and 2.5 units of Thermus aquaticus DNA 

20 polymerase (Taq polymerase) . Each primer pair includes a 
forward (upstream) primer which is identical to the 
coding strand at the upstream end of an overlap region P; of 
duplex genomic DNA and a reverse primer Rj which is comple- 
mentary to the coding at the downstream end of the region 

25 Pj. The sets of primers each define an overlap region of 
about 200 basepairs, and the spacing between adjacent 
overlapping primer regions (i.e., between adjacent pairs of 
Fj/Rj pairs) is about 0.5-1 kilobase. The regions of HCV 
which are complementary to the primers are given below: 

30 F lf basepairs 183-201; R,, basepairs 361-380 
F, 0 , basepairs 576-595; R I0/ basepairs 841-860 
F 2 , basepairs 1080-1100; R 2 , basepairs 1254-1273 
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F 3 , basepairs 1929-1948; R 3 , basepairs 2067-2086 
F 4 , basepairs 2754-2733; R,, basepairs 2920-2940 
F 5/ basepairs 3601-3620; R jf basepairs 3745-3764 
F 6/ basepairs 4301-4320; Rj, basepairs 4423-4442 
5 F 12 , basepairs 4847-4865; R 12/ basepairs 4715-4734 
F 7/ basepairs 5047-5066; R 7 , basepairs 5200-5216 
F 8 , basepairs 5885-5904; R g , basepairs 6028-6047 
F 9/ basepairs 6902-6921; R,, basepairs 7051-7070 

Polymerase Chain Reaction (PCR) amplification of the 
10 SISPA-amplified cDNA fragments with each Fj/Rj primer pair 
was carried out under conditions similar to those used 
above, with about 25 cycles. 

The amplified fragment mixtures from above were each 
fractionated by electrophoresis on 1.5% agarose and 
15 transferred to nitrocellulose filters (Southern) . Hybrid- 
ization of the nitrocellulose-bound fragments, each with an 
internal-sequence oligonucleotide probe confirmed that each 
fragment contained the expected sequences. Hybridization 
was carried out with an internal oligonucleotide radiola- 
20 beled by polynucleotide kinase, according to standard 
methods. 



Example 9 
Prepari ng Linking Fragments 

25 This example describes preparing large overlapping 

linking fragments of the HCV sequence. SISPA-amplified 
cDNA fragments from Example 7 were mixed with 100 ul Buffer 
A, 1 //M of equal molar amounts of forward and reverse 
primers in each of the primer pairs given below, 200 fjK each 

30 of dATP, dCTP, dGTP, and dTTP, and 2.5 units of Thermus 
aquaticus DNA polymerase (Taq polymerase) , as in Example 8. 
Each primer pair includes a forward primer Fi and a reverse 
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primer Rj, where F 5 is the forward primer for one overlap 
region Pj, and is the reverse primer of the adjacent 
overlap region. Thus each linking fragment spans two 
adjacent overlap regions. The sets of primers each define 
5 a linking fragment of about 0.5-1 kilobases. The sequences 
of the primer pairs are given in Example 8. The overlap- 
ping linking fragments of the HCV sequence (Appendix) 
spanned by each primer pair is given below: 
F,/R, 0/ basepairs 183-860 

10 F, 0 /R 2 , basepairs 576-1273 
F 2 /R 3 , basepairs 1080-2086 
F3/R,, basepairs 1929-2940 
F 4 /R5, basepairs 2754-3762 
Fj/Rj, basepairs 3601-4442 

15 F 6 /R 12 , basepairs 4301-4865 
F 12 /R 7 , basepairs 4715-5216 
F 7 /Rg, basepairs 5047-6047 
Fg/R,, basepairs 5885-7070 

20 Two-primer amplification of the SISPA-amplified cDNA 

fragments with each Fj/Rj primer pair was carried out under 
conditions similar to those described above, with about 25 
cycles. 

The amplified fragment mixtures from above were each 
25 fractionated by agarose electrophoresis on 1.2 % agarose, 
and transferred to nitrocellulose filters (Southern) for 
hybridization with radiolabeled internal oligonucleotide 
probes as above. The analysis confirmed that each linking 
fragment contained the two end-primer sequences from 
30 adjacent overlap regions. The sequences contained in each 
of the linking fragments are indicated in the Appendix. 
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Example 10 
Preparing Cloned Peptide Fragments 
A. DNA Fragment Digestion 

Each of the ten linking fragments from Example 9 was 
5 suspended in a standard digest buffer (0.5M Tris HC1, pH 
7.5; 1 mg/ml BSA; lOmM MnC12) to a concentration of about 
1 mg/ml and digested with DNAse I at room temperature for 
various times (1-5 minutes) . These reaction conditions 
were determined from a prior calibration study, in which 
10 the incubation time required to produce predominantly 100- 
300 basepair fragments was 

determined. The material was extracted with phe- 
nol/chloroform before ethanol precipitation. 

The fragments in the digest mixture were blunt-ended 

15 and ligated with EcoRI linkers. The resultant fragments 
were analyzed by electrophoresis (5-lOV/cm) on 1.2% agarose 
gels, using PhiX174/HaeIII and lambda/Hindlll size markers. 
The 100-300 bp fraction was eluted onto NA45 strips 
(Schleicher and Schuell) , which were then placed into 1.5 

20 ml microtubes with eluting solution (1 M NaCl, 50 mM 
arginine, pH 9.0), and incubated at 67 °C for 30-60 minutes. 
The eluted DNA was phenol/chloroform extracted and then 
precipitated with two volumes of ethanol. The pellet was 
resuspended in 20 nl TE buffer (0.01 M Tris HC1, pH 7.5, 

25 0.001 M EDTA) . 

B. Cloning the Digest Fragments 

Lambda gtll phage vector (Young et al.) was obtained 
from Promega Biotec (Madison, WI) . This cloning vector has 
30 a unique EcoRI cloning site 53 base pairs upstream from the 
beta-galactosidase translation termination codon. The 
partial digest fragments from each linking fragment in Part 
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10 



15 



A were introduced into the EcoRI site by mixing 0.5-1.0 //g 
EcoJJJ-cleaved lambda gtll, 0.3-3 4 of the above sized frag- 
ments, 0.5 /fl 10X ligation buffer (above) , 0.5 //l DNA ligase 
(200 units), and distilled water to 5 //l. The mixture was 
incubated overnight at 14«>C, followed by in vitro packag- 
ing, according to standard methods (Maniatis, pp. 256-268) . 

The packaged phage were used to infect E. coli strain 
KM392, obtained from Dr. Kevin Moore, DNAX (Palo Alto, CA) . 
Alternatively, E. coli strain Y1090, available from the 
American Type Culture Collection (ATCC No. 37197) , could be 
used. A lawn of KM392 cells infected with about 10 3 -10 4 pfu 
of the phage stock from above was prepared on a 150 mm 
plate and incubated, inverted, for 5-16 hours at 37 °C. The 
infected bacteria were checked for loss of beta-galactosi- 
dase activity (clear plaques) in the presence of x-gal 
using a standard X-gal substrate plaque assay method 
(Maniatis) . 

Identification of single plaques containing a digest- 
fragment insert was confirmed as follows. Clear single 
plaques (containing the progeny of a single phage) were 
removed from the plate and suspended in extraction buffer 
(Maniatis) to release the phage DNA. The phage extract was 
added to the above DNA amplification mixture in the pre- 
sence of primers which are about 70 basepairs away in ei- 
25 ther direction from the EcoRI site of lambda gtll. Thus 
phage containing a digest-fragment insert will yield an 
amplified digest fragment of about 140 basepairs plus in- 
sert. Phage DNA amplification was carried out as described 
above, with 25 cycles of amplification. The reaction mate- 
rial from each plaque tested was fractionated on 1.5% aga- 
rose, and examined for the size of amplified digest frag- 
ments. Non-recombinant phage gave a 140 basepair band, and 



20 



30 
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recombinant phage, a band which is 140 basepair plus the 
insert sequence in size. The results are shown in column 
2 (REC Freq) of Table 3 below, for the six linking-fragment 
libraries indicated in the first column in Table 3 below. 
5 The denominator in the column-2 entries is the total number 
of plagues assayed by primer amplification. The numerator 
is the number of clear plagues containing fragment inserts. 
Thus, 3/15 means that 3 plagues tested positive by PCR out 
of a total of 15 clear plagues assayed. 
10 Table 3 



Library 1 


REC Freer 2 


1° Screen 3 


PA /REC 4 


F2R3 #2 


3/15 


2 


0.33 


F3R4 #1 


7/12 


0 




F4R5 #3 


9/10 


10 


0.37 


F5R6 #5 


11/12 


37 


1.35 


F7R8 #7 


0/12 


1 




F8R9 #10 


3/12 


58 


7.73 



20 Libraries constructed by partial DNasel Digestion of indicated 

linking clone 

2 - Recombinant frequency determined by PCR with insert flanking lambda 

gtll primers 

3 - Primary screening with chronic human PT-NANBH serum (1:100) on 1.5X10 
25 phage 

4 - PA/REC indicates the number of positive areas detected per actual 

number of recombinant plated 

The library of digest fragments constructed for each 
30 linking fragment was screened for expression of peptides 
which are immunoreactive with a human PT-NANBH serum. The 
lawn of phage-infected bacteria was overlaid with a nitro- 
cellulose sheet, transferring PT-NANBH recombinant peptides 
from the plaques to filter paper. The plate and filter 
35 were indexed for matching corresponding plate and filter 
positions . 

The filter was removed after 6-12 hours, washed three 
times in TBS buffer (10 mM Tris, pH 8.0, 150 mM NaCl) , 
blocked with AIB (TBS buffer with 1% gelatin) , washed again 
40 in TBS, and incubated overnight with of antiserum (diluted 
to 1:100 in AIB, 12-15 ml/plate). The sheet was washed 
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twice in TBS and then incubated with alkaline-phosphatase- 
conjugated anti-human IgG to attach the labeled antibody at 
filter sites containing antigen recognized by the anti- 
serum. After a final washing, the filter was developed in 
5 a substrate medium containing 33 /,1 NBT (50 mg/ml stock 
solution maintained at 4'C) mixed with 16 „1 BCIP (50 mg/ml 
stock solution maintained at 4«C) in 5 ml of alkaline phos- 
phatase buffer (100 mM Tris, 9.5, 100 mM NaCl, 5 mM MgC12) . 
Reacted substrate precipitated at points of antigen produc- 
10 tion, as recognized by the antiserum. 

The total number of plagues which showed antigen-posi- 
tive reaction (positive areas PA) in the primary screen are 
given in the third column in Table 3. The fourth column in 
the table is the frequency of positive areas per total num- 
ber of recombinant phage screened (x 10 3 ) . This last column 
is therefore a measure of the relative immunogenicity of 
antigen expressed from a particular linking fragment using 
this particular serum sample. 

20 Example n 

Screening n jgest Fragments 
The digest-fragment libraries of each of the ten link- 
ing fragments from Example 9 were screened with sera from 
a human patient with chronic PT-NANBH and with pooled sera 
5 from chimpanzees with acute PT-NANBH infection and chronic 
PT-NANBH infection. Individual chronic and acute chimpan- 
zee sera from 5 chimpanzees were obtained from the Centers 
for Disease Control. 

The digest-fragment libraries from the linking frag- 
10 ments indicated in Table 4 below were screened with each of 
the three sera, using the screening procedure described in 
Example 10. The total number of positive areas observed in 
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each plate (making up one fragment library) is given in the 
table. The entries in the table which are not in parenthe- 
ses represent the number of positive areas which were 
confirmed by plaque purification, i.e., by replating 
5 plagues from the positive areas at low dilution and con- 
firming a positive area (secondary screen) . Typically 
about 90-95 percent of the positive areas in the primary 
screen tested positive by secondary screening. The entries 
in parentheses indicate positive areas which have not been 
10 confirmed in a secondary screen. 

As seen from Table 4, all but one of the linking frag- 
ment libraries contained seguences encoding peptide 
antigens which are immunoreactive with either chronic human 
or chimpanzee infected sera. Five of the libraries contain 
15 sequences encoding antigens which are immunoreactive with 
acute sera, indicating that one or more of the antigens in 
this group are effective to detect acute-infection serum. 
Three of these latter libraries — Fj/R*, F, 2 /R 7/ and F 7 /R 8 — 
gave over 10 positives in each library. These data are not 
10 corrected for the recombinant frequency in a particular 
library and therefore do not reflect the comparative 
immunogenicity of the various linking fragments. 
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Table 4 

Acute Pool 
P.P. Clones 



Chronic Pool 
P.P. Clones 



F1R10 

F10R2 

F2R3 

F3R4 

F4R5 

F5R6 

F6R12 

F12R7 

F7R8 

F8R9 



0 


0 


4 


2 


4 


0 


0 


10 


5 


0 


34 


0 


(400) 


5 


2 


17(200) 


0 


20 


60 


0 



( ) 

p.p. 

Acute Pool 
Chronic Pool 



not plague purified 
Plaque Pure 
CDC Panel of Chimps 
CDC Panel of Chimps 



0 
4 
1 
10 
7 

(42) 
10(200) 
9(200) 
10(130) 
1 



25 



30 



35 



Example 12 
Immunoscreeninp f or 409-1-1 -Antigen 
A. Plaque Immunoscreening 

Several clear plagues identified in the primary screen 
of the F 4 /R5 linking fragment were replated and plague puri- 
fied, one of the purified plagues was designated gtll/409- 
1-1 (c-a). The digest fragment contained in clone 409-1- 
l(c-a) corresponds to two sets of base pairs present in the 
HCV genome and present in clone 409-1-1 (abc) . For ease of 
reference three regions (a, b, and c,) have been designated 
in the 409-1-1 (abc) clone (see below and Figure 5). The 
longest homology of base pairs corresponds approximately to 
nucleotides 2754 to 3129 of the Appendix (the "a" region, 
see Figure 5 ; region delineated by boxes) and the shorter 
homology corresponds approximately to nucleotides 3242 to 
3311 of the Appendix (the "c" region, see Figure 5): nor- 
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mally the "c" region is located approximately 112 nucleo- 
tides distal the 3' end of the "a" region (see Figure 5). 
The complete sequence of the gtll/409-l-l(c-a) insert is 
given in Figure 6 and presented as SEQ ID NO: 7. This clone 
5 arose through a ligation event between two independent 
DNasel fragments generated from the F 4 /R 5 linking clone and 
has ATCC No. 40792. A related clone, designated 409-1- 
l(abc), has been described in co-owned patent application 
Ser. No. 505,611 and has ATCC No. 40876. 
10 A lambda gtll clone corresponding to the immunoreac- 

tive sequence reported in the EPO application 88310922.5, 
and designated 5-1-1, was prepared by primer-specific 
amplification of the amplified cDNA fragments generated in 
Example 7. The 5-1-1 sequence corresponds to basepairs 
15 3730-3858 of the HCV sequence (Appendix) , in the linking 
fragment F 5 /^. The primers used for fragment amplification 
are 20 basepair oligomers complementary to the forward and 
reverse sequences of the 3732-3857 basepair 5-1-1 sequence. 
Both oligomers have EcoRI sites incorporated into their 
20 ends and the forward oligomer is designed to ensure a con- 
tiguous open reading fram with the beta-galactosidase gene. 
The amplified 5-1-1 sequence was purified by agarose gel 
electrophoresis, and cloned into lambda gtll phage. Ampli- 
fication and cloning methods were as described above. 
25 Phage containing the 5-1-1 sequence were identified and 
purified by primary and secondary screening, respectively, 
with human PT-NANBH serum, also as described above. 

The purified gtll/409-l-l(c-a) and gtll/5-1-1 clones 
were each mixed with negative lambda gtll phage, plated and 
30 immunbscreened with a number of different donor sera from 
normal and NANBH-infected humans and chimpanzees, as indi- 
cated in Table 5 below. Each plate was divided into seve- 
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ral equal-area sections, and the corresponding sections on 
the nitrocellulose transfer filter were separately screened 
with the donor sera indicated, using the immunoscreening 
method described in Example 11. The number of positives 
detected for each group of sera by the 5-1-1 and 409-1-1 
(c-a) peptides are shown, as well as a comparison with the 
C-100 test in the ELISA format, in Table 5. 



Source 


Diagnosis 


# Donors 




# Positive 










5-1-1 


409-1-1 fe-a) 


C-100 


Human 


Normal 


2 


0 


0 


NT 


Human 


ANAB 


6 


4 


5 


o/i* 


Chimp 


Normal 


7 


0 


0 


0/5 


Chimp 


Acute 


5 


1 


3 


0/5 


Chimp 
NT, not tes 


Chronic 
ted: * onlv F 


8 


7 


7 


5/5 



20 



» — - r — ■•»« 

of five sera tested. 



25 



30 



35 



B. Western Blot Screening 

For Western blot screening, gtll/409-l-l(c-a) phage from Example 
11 was used to infect E. coli BNN103 temperature-sensitive bacteria. 
These bacteria were obtained from the American Type Culture Collection. 
The bacterial host allows expression of a beta-galactosidase/peptide 
antigen fused protein encoded by the vector under temperature induction 
conditions (Hunyh) . 

Infected bacteria were streaked, grown at 32«C overnight or until 
colonies were apparent, and individual colonies were replica plated and 
examined for growth at 32"C and 42<t. Bacterial colonies which grew at 
32«C, but not 42"C, indicating integration of the phage genome, were 
used to inoculate 1 ml of NZYDT (Maniatis) broth A saturated overnight 
bacterial culture was used to inoculate a 10 ml culture, which was 
incubated with aeration to an O.D. of about .2 to .4, typically 
requiring 1 hour incubation. The culture was then brought to 43" c 
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quickly in a 43°C water bath and shaken for 15 minutes to induce lambda 
gtll peptide synthesis, and incubated further at 37°C for 1 hour. 

The cells were pelleted by centrifugation, and 1 ml of the 
pelleted material was resuspended in 100 /jl of lysis buffer (62 mM 
5 Tris, pH 7.5 containing 5% mercaptoethanol , 2.4 % SDS and 
10% glycerol) . Aliquots (about 15 nl) were loaded directly 
onto gels and fractionated by SDS-PAGE. After electropho- 
resis, the fractionated bands were transferred by electro- 
elution to nitrocellulose filters, according to known 
10 methods (Ausubel et al.). 

The lysate was treated with DNasel to digest bacterial 
DNA, as evidenced by a gradual loss of viscosity in the 
lysate. An aliquot of the material was diluted with Triton 
X-100™and sodium dodecyl sulfate (SDS) to a final concen- 
15 tration of 2% Triton X-100™ and 0.5% SDS. Non-solubilized 
material was removed by centrifugation and the supernatant 
was fractionated by SDS polyacrylamide electrophoresis 
(SDS-PAGE) .PAGE, A portion of the gel was stained, to 
identify the peptide antigen of interest, and the corre- 
20 sponding unstained band was transferred onto a nitrocellu- 
lose filter. 

The 5-1-1 antigen coding sequence (Example 11) was 
also expressed as a glutathione-S-transferase fusion 
protein using the pGEX vector system, according to pub- 
15 lished methods (Smith) . The fusion protein obtained from 
bacterial lysate and fractionated by SDS-PAGE were trans- 
ferred to a nitrocellulose filter for Western blotting, as 
above . 

Western blotting was carried out substantially as 
0 described in Example 10. Briefly, the filters were blocked 
with AIB, then reacted with the serum samples identified in 
Table 5, including human and chimpanzee normal, chronic 
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NANBH, and hepatitis B (HBV) sera sample. The presence of 
specific antibody binding to the nitrocellulose filters was 
assayed by further immunobinding of alkaline-phosphatase 
labelled anti-human IgG. The results of the Western blot 
analysis with the Sj26/5-l-l fusion protein and /409-l-i(c- 
a) fusion proteins are shown in Table 6. The data confirm 
that 409-l-i<c-a) and 5-1-1 peptide antigens are specifi- 
cally immunoreactive with human and chimpanzee NANBH 
antisera. 

Table 6 



Source 



15 



20 



25 



30 



Diagnosis # Donors # Positive 



Human 


Normal 


2 


Sj26 
5-1-1 

0 


0-gal 
409-1-1 fe-.i 

0 


Human 


NANB 


7 


5 


5 


Human 


HBV 


1 


0 


0 


Chimp 


Normal 


5 


0 


0 


Chimp 


NANB 


6 


5 


5 


Chimp 


HBV 


1 


0 


0 



Example 13 
Generation nf Alternative Clones 
Alternative clones were generated from the region iden- 
tified in Example 12 as encoding antigen specifically immu- 
noreactive with human and chimpanzee NANBH antisera. The 
primers shown in Table 7 were selected from the HCV or 409- 
1-1 (abc) coding sequences to generate a variety of over- 
lapping clones. 



35 
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Table 7 
Primer Sequence 

5 33C-F1 CCGAATTCGCGGTGGACTTTATCCCTGT 

33C-R1 CCGAATTCCAGAGCAACCTCCTCGATG 

409-1-1 ( C-a ) F CCGAATTCCGCACGCCCGCCGAGACTAC 

409-1-1-F1 CCGAATTCTCCACCACCGGAGAGATCCC 

409-1-1-R2 CCGAATTCCACACGTATTGCAGTCTATC 

1 0 409-1-1-F3 CCGAATTCGTCACCCAGACAGTCGAT 

409-1-1-R5 CCGAATTCCCCTCCCAAAATTCAAGATGG 

409-1-1 ( c-a ) R CCGAATTCGCCAGTCCTGCCCCGACGTT 

409-1-1CR CCGAATTCGTCCTGGCACACGGGAAG 

15 The primers shown in Table 7 were used in DNA amplifica- 

tion reactions as described in Examples 7B and 8: the pri- 
mers and templates used in each reaction are shown in Table 
8. The amplified fragments were then treated with the 
Klenow fragment of DNA polymerase I, under standard condi- 

20 tions (Maniatis et al.)# to fill in the ends of the mole- 
cules. The blunt-end amplified fragments were digested 
with EcoRI under standard conditions and cloned into lambda 
gtll expression vectors essentially as described in Example 
10B. The resulting inserts are aligned for comparison in 

25 Figure 7. 



Table 8 



30 



35 



Generated Fragment 

33C 
33CU 

409-1-1 (F1R2) 
409-1-1 (a) 
409-1-1 (c) 
409-1-1 (c+270) 
409-1-lu 



Template 

cDNA" 
cDNA" 

gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 



Primers 

33-C-F1 and 409-1-1-R2 
33-C-F1 and 33-C-R1 
409-1-1-F1 and 409-1-1-R2 
409-1-1-F1 and 409-1-lcaR 
409-1-lcaF and 409-1-1CR 
409-1-lcaF and 409-1-1-R2 
409-1-1-F3 and 409-1-lcaR 



40 



* Amplified cDNA fragments from Example 7 
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Example n 

Immunoscreen-ina of the Mtemaltive Clones 
The alternative clones generated in Example 12 were im- 
munoscreened essentially as described in Example 10B. 
5 Clones 409-1-1 (abc) and 409-1-1 (c-a) , generated in Example 
12, were also included in the following immunoscreenings . 
The results of the preliminary immunoscreening are shown in 
Table 9. 

10 Table 9 

PLI-l 



25 



33C 
15 33cu 



+ 
+ 

409-1-1 (abc) + 
409-1-1 (F1R2) + 
409-1-1 (a) + 
409-1-1 (ca) + 
20 409-1-1 (C) 

409-1-1 (c+270) + 
409-1-1 u 



ND" 

ND 

ND 

ND 

ND 

ND 

ND 



*Not Done 



The GLI-l sera was a human chronic PT-NANBH sera. If a 
clone tested negative with GLI-l it was further examined by 
screening with FEC, a human chronic PT-NANBH sera. 

The seven of the 9 alternative clones which tested posi- 
30 tive by the above preliminary immunoscreening were more ex- 
tensively screened against a battery of sera. In addition, 
clone C100 (see Background) was included in the screening. 
The results of this more exhaustive screening are presented 
in Table 10. 

35 
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Table 10 



Serum ANTIGEN 



409-1-1 409-1-1 409-1-1 409-1-1 409-1-1 5-1-1 

C100 33C 33Cu abc FIR2 a c + 270 ca 





SKF(-) 


• 


• 


- 


- 


- 


- 


• 


- 


10 


FEC{ + ) 


+ 


+ 3 


+ 3 


+ 1 


+ 2 


+ 2 


- 


+ 2 




BV 


+ 2 


+ 3 


I 


+ 1 


+ 1 


• 


+ 1 


- 




Bar 


+ 2 


+ 2 


I 


- 


- 


- 


• 


- 




PP(-) - 




■ 


■ 


• 


• 


- 


- 


- 




AP 


+ 1 


+ 2 




I 


• 


- 


I 


■ 




CP + 


+ 2 


+ 3 


+ 2 


+ 3 


+ 3 


1 


+ 3 


+ 2 




1 

2 


* 
- 


- 
- 


- 
- 


- 
- 


- 

- 


- 
- 


- 
- 




20 


3 




















4 






I 


1 












5 




+ 1 
















6 


+ 1 


+ 3 


+ 1 


+ 1 


+ 1 




+ 1 


+ 1 




7 


+ 2 


+ 3 


+ 1 


+ 2 


+ 2 




+ 2 


+ 1 


25 


38 




1 


+ 1 


I 


1 






1 




39 




+ 1 


1 


+ 1 






1 


1 




40 + 


+ 1 


+ 2 


+ 1 


+ 1 




1 


+ 1 


+ 1 




41 + 


+ 2 


+ 3 


+ 1 


+ 1 


+ 1 




+ 2 


+ 1 




42 + 


+ 2 


+ 3 


+ 1 


+ 1 


+ 1 




+ 2 


+ 1 


30 


43 




















44 


I 


I 
















45 


I 


+ 1 


I 


I 






! 


1 




46 + 


+ 1 


+ 2 


+ 1 


+ 2 


+ 1 




+ 1 


1 




47 + 


+ 1 


+ 2 


+ 2 


+ 2 


+ 3 


m 


+3 


+1 


35 


B18 - 


+3 


+3 


+1 


+3 


+3 




+3 






A7 


+3 


+3 


+1 


+1 


+3 




+3 


+3 




C7 


+2 


+3 
















A3 


+3 


+3 


+1 


+2 


+1 




+2 






B7 


+2 


+3 


I 


+3 


+3 




+3 


I 


40 


C12 + 


+2 


+3 















The serum samples used for screening were identified as 
follows: SKF, PT-NANBH negative; FEC, PT-NANBH positive; 
BV, community acquired NANBH; Bar, PT-NANBH positive; PP 
45 (pre- inoculation pooled chimpanzee serum) , PT-NANBH 
negative; AP (acute HCV pooled chimpanzee serum) , PT-NANBH 
positive; and, CP (chronic HCV pooled chimpanzee serum) PT- 
NANBH positive. The numbered serum samples correspond to 
human clinical serum samples which were PT-NANBH positive. 
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The PP, CP, and AP sera were pooled sera samples from 5 
different chimpanzees: the chimpanzee serum samples were 
obtained from the Centers for Disease Control. The scoring 
system presented in Table 10 is a qualitative scoring 
system defined as follows: (-) , a clear negative; (+) , 
(1+) , (2+) , (3+), increasing strength of positive signal, 
with (3+) being the strongest signal; and (I) stands for 
Indeterminate, where two readings were different and not 
repeated. 

In view of the data presented in Table 10 the sensitivi- 
ty of the antigens in terms of immunoscreening is 33cu > 
33c > 409-1-1 (c-a) > 409-1-1-F1R2 > 409-1-1 (abc) & 409-1-la 
> 5-1-1 > 409-1-1- (c+270) . Although 33cu and 33c were sen- 
sitive antigens, they reacted with high background against 
all sera. Accordingly, the 409-1-1 series are more useful 
as diagnostic antigens since they are more specific to HCV 
induced antibodies. 

The immunoscreening was further extended to include the 
clone 36 and 45 (corresponds to clone 40) encoded epitopes 
which were identified above. Table 11 shows the results of 
the immunoscreening. 



Table 11 

PANEL I: SEROCONVERSION SPECIMENS 



5 



SERUM 

C- 


100 


33C 


5.1.1 


ANTIGEN 
409-1-1 

fc-a) 


36 


45 gtll 


GLI-1 


+ 


4+ 


2+ 


4+ 




3+ 


FEC 


+ 


4+ 


3+ 


4+ 


3+ 




BV 




3+ 




3+ 






SKF(norm) 














1-N01/D69 




I 










2- "/D124 




+ 
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3- M /D146 - 

4- »/D211 - 

5- N0O/D22 
5 6- "/D29 

7- M /D41 

8- »/D60 

9- "/D137 + 

10 10-N240/D0 - 

11- M /D45 - 

12- H /D71 - 

13- M /D89 - 

14- »/D106 - 
15 15- »/D155 - 

16- N228/D0 - 

17- »/D31 - 

18- »/D41 - 
20 19- "/D51 - 

20- »7D73 - 

21- M /D93 - 

22- "/D127 - 

25 23-N192/D114 

24- »/D184 - 

25- »/D224 - 

26- »/D280 - 

30 27-N176/D0 - 

28- »/D66 - 

29- "/D77 

30- »/D94 - 

31- »/D200 - 

35 

32- N170/D0 - 

33- »/D27 - 

34- »/D49 - 
40 35- "/D64 - 

36- "/D183 - 

37- »/D278 - 





71 






I 


- 






+ 




mm 


mm 


+ 


i 


I 


— 


2+ 




04. 




3+ 




3+ 




44- 




4+ 




4+ 


4+ 


4+ 


2+ 


I 




X 




I 




T 

J. 




I 








I 




- 


- 


I 








I 


- 


- 


- 


I 








I 


- 






I 


— 


- 


- 


T 
X 










I 














I 









4 
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SERUM 
gtll 



ANTIGEN 

C-100 33C 5.1.1 409-1-1 36 45 



.(c-a) 



I 
+ 



38- N144/D63 - I 

39- "/D72 - I _ _ 

40- "/D91 + 2+ + 2+ - 
10 41- «/D289 + 4+ + 3+ ,+ 

42- «/D233 + 4+ 3 + 4+ 2+ 

43- N122/D0 I 

44- M /D51 - I I 
15 45- »/D57 -2+1 

46- »7D72 + 2+ 3+ T 

47- ^«/D94 + 3+ + 4+ + 

48- VD199 + 4+ 2+ 4+ + 

20 49-N31/D0 - I 

50- "/D140 - _ Z 

51- "/D154 - - 

52- "/D170 - 

53- "/D210 - 
25 54- n /D266 - 

55 - "/D336 - - ~_ 

56- "/D394 - _ _ 

57- N16/D0 - 
30 58- "/D47 - _ 

59- »/D62 - _ I 

60- »/D83 - 

61- «/D137 - 
61- "/D167 - - 

35 63- "/D197 - _ _ 

64- M /D370 - - _ 



The screening sera GLI-l, fec, BV, and SKF have been 
defined above. The numbered sera samples correspond to 
human clinical serum samples which were PT-NANBH positive: 
these samples were obtained from Dr. Francoise Fabiani- 
Lunel, Hospital La Pitie Salpetriere, Paris, France. As 
can be seen from the results presented in Table 11, the 
antigens produced by clones 36 and 40, while not as sensi- 
tive as 409-1-1 (c-a), do yield HCV-specific immunopositive 
signals. 
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Example 14 
Isolation of 409-1-1 Fusion Prnfrgin 

Sepharose 4B beads conjugated with anti-beta galacto- 
sidase were purchased from Promega. The beads were packed 
5 in 2 ml column and washed successively with phosphate- 
buffered saline with 0.02% sodium azide and 10 ml TX buffer 
(10 mM Tris buffer, pH 7.4, 1% aprotinin) . 

BNN103 lysogens infected with gtll/409-l-l(c-a) from 
Example 12 were used to inoculate 500 ml of NZYDT broth. 
10 The culture was incubated at 32°C with aeration to an O.D. 
of about .2 to .4, then brought to 43° C quickly in a 43°c 
water bath for 15 minutes to induce gtll peptide synthesis, 
and incubated further at 37°C for 1 hour. The cells were 
pelleted by centrifugation, suspended in 10 ml of lysis 
15 buffer (10 mM Tris, pH 7.4 containing 2% Triton X-100™ and 
1% aprotinin added just before use. The resuspended cells 
were frozen in liquid nitrogen, then thawed, resulting in 
substantially complete cell lysis. The lysate was treated 
with DNasel to digest bacterial and phage DNA, as evidenced 
20 by a gradual loss of viscosity in the lysate. Non-solubi- 
lized material was removed by centrifugation. 

The clarified lysate material was loaded on the 
Sepharose column, the ends of the column were closed, and 
the column was placed on a rotary shaker for 2 hrs. at room 
25 temperature and 16 hours at 4°C. After the column settled, 
it was washed with 10 ml of TX buffer. The fused protein 
was eluted with 0.1 M carbonate/bicarbonate buffer, pHlO. 
A total of 14 ml of the elution buffer was passed through 
the column, and the fusion protein eluted in the first 4-6 
30 ml of eluate. 

The first 6 ml of eluate from the affinity column were 
concentrated in Centricon™-3 0 cartridges (Amicon, Danvers, 
Mass.). The final protein concentrate was resuspended in 
400 »1 PBS buffer. Protein purity was analyzed by SDS-PAGE. 
35 A single prominent band was observed. 
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Example is 

Preparation of Anti-409-i-i (r. - a ) Antibody 
The 409-1-1 (c-a) digest fragments from lambda gtll 
were released by BcoRI digestion of the phage, and the "A" 
5 region purified by gel electrophoresis. The purified frag- 
ment was introduced into the pGEX expression vector 
(Smith). Expression of glutathione S-transf erase fused 
protein (Sj26 fused protein) containing the 409-1-1 (a) 
peptide antigen was achieved in E. coli strain KM392 
10 (above). The fusion protein was isolated from lysed 
bacteria, and isolated by affinity chromatography on a 
column packed with glutathione-conjugated beads, according 
to published methods (Smith) . 

The purified Sj26/409-l-i(a) fused protein was 
15 injected subcutaneous ly in Freund's adjuvant in a rabbit. 
Approximately l mg of fused protein was injected at days o 
and 21, and rabbit serum was collected on days 42 and 56. 

A purified Sj26/5-l-i fused protein was similarly 
20 prepared using the an amplified HCV fragment encoding the 
5-1-1 fragment. The fused Sj26/5-l-l protein was used to 
immunize a second rabbit, following the same immunization 
schedule. A third rabbit was similarly immunized with 
purified Sj26 protein obtained from control bacterial 
25 lysate. 

Minilysates from the following bacterial cultures were 
prepared as described in Example 12: (l) KM392 cells 
infected with pGEX, pGEX containing the 5-1-1 insert, and 
pGEX containing the 409-l-i(a) insert; and (2) BNN103 

30 infected with lambda gtll containing the 5-1-1 insert and 
gtll containing the 409-1-1 (c-a) insert. The minilysates 
were fractionated by SDS-PAGE, and the bands transferred to 
nitrocellulose filters for Western blotting as described in 
Example 12. Table 12 shows the pattern of immunoreaction 

35 which was observed when the five lysate preparations (con- 
taining the antigens shown at the left in the table) were 
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10 



15 



20 



25 



screened with each of the three rabbit immune sera. Summa- 
rizing the results, serum from control (Sj26) rabbits was 
immunoreactive with each of the Sj26 and Sj26 fused protein 
antigens. Serum from the animal immunized with Sj26/5-l-i 
fused protein was reactive with all three Sj-26 antigens 
and with the beta-gal/5-l-l fusion protein, indicating the 
presence of specific immunoreaction with the 5-1-1 antigen. 
Serum from the animal immunized with Sj26/409-i-i(a) fused 
protein was reactive with all three sj-26 antigens and with 
the beta-gal/409-l-l(c-a) fusion protein, indicating the 
presence of specific immunoreaction with the 409-1-1 (a) 
antigen. None of the sera were immunoreactive with beta- 
galactosidase (obtained from a commercial source) . 



Table 12 

Antigens Antibody 



Sj26 

Sj26 

5-1-1/ 
(Sj26) 



Sj26 5-l-l/Sj26 409-l-l(a)/- 



5-1-1/ 

30 Off-bal) - + _ 

409-l-l(a) 

(Sj26) + + + 

35 409-1-1 (c-a) 

(/?-gal) - - + 

Anti-409-i-i(a) antibody present in the sera from the 
animal immunized with the Sj26/409-l-l (a) is purified by 
40 affinity chromatography, following the general procedures 
described in Example 12, but where the ligand derivatized 
to the Sepharose beads is the purified beta-gal/409-l-l(c- 
a) fusion protein, rather than the anti-beta-galactosidase 
antibody. 
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Example ifi 

Cloning the HCV Capsid Prnj-»i n Coding figq nonnog 
The example describes the cloning of HCV coding 
sequences which encodes the N-terminal region of the HCV 
5 capsid protein. 

The protein sequence of the HCV-capsid associated 
antigen corresponds to the nucleotide residues 325-970 of 
the full length HCV sequence (see Appendix A). The 
following sequences were used as PCR primers to clone this 
10 region: SF2(C), 5' end starting at nucleotide 325 of the 
full length HCV sequence (Appendix), 5 ' -GCGCCCATGGGCACG- 
ATTCCCAAACCTCA; and SRl(C), 3' end starting at nucleotide 
969 of the full length HCV sequence (Appendix), 5'-GCCGG- 
ATCCCTATTACTC (G/A) TACACAAT (A/G) CT (C/T) GAGTT (A/G) G. The 
15 anticipated size of the fragment generated using the 
SF2(C)/SR1(C) primer pair was 644 base pairs. 

SISPA-amplified cDNA fragments from Example 7 were 
mixed with 100 //l Buffer A, l „m of equal molar amounts of 
each SR2 and SFl primer given above, 200 /M each of dATP, 
dCTP, dGTP, and dTTP, and 2.5 units of Thermus aauatjcus 
DNA polymerase (Tag polymerase) , as in Example 8 . 

Specific amplification of the SISPA-amplified cDNA 
fragments with the capsid primer pair given above was 
carried out under conditions similar to those described in 
25 Example 7, with 1 minute at 72°C and about 30 cycles. 

The amplified fragment mixtures from above were each 
fractionated by agarose gel electrophoresis on duplicate 
1.2% agarose gels, and one of the gels transferred to 
nitrocellulose filters (Southern) for hybridization with 
30 with a radioactively labelled oligonucleotide (Southern) 
having the following sequence: SF3(M/E), 5' end starting 
at nucleotide 792 of the full length HCV sequence (Appen- 
dix) , 5 ' -GCGCCCATGGTTCTGGAAGACGGCGTG . This oligonucleotide 
corresponds to a sequence internal to the amplification 
35 product generated by using the SF2(C) and SRl(C) primers. 



20 
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Eight out of 15 PCR products were identified which gave a 
positive hybridization signal with the internal probe. 

The vectors pGEX (Example 15) and pET (NOVAGEN, 565 
Science Drive, Madison, WI 53711) were chosen for bacterial 
5 expression of protein sequences encoded by the inserts. 
The pGEX vector provided expression of the inserted coding 
sequences as fusion proteins to Sj26 (see Examples 12 and 
15) and the pET vector provided expression of the cloned 
sequences alone. To clone the capsid sequences, the 
10 amplification product bands were excised from the duplicate 
gel. The DNA was extracted from the agarose and doubly- 
digested with Ncol and BamHI. A pGEX vector containing the 
BamHI/Ncol cloning sites was also doubly digested with 
BamHI and Ncol. The vector and extracted DNA were then 
15 ligated under standard conditions and the ligation mixture 
transformed into bacterial cells. 

The bacterial transformants were cultured under 
ampicillin selection, and the plasmid DNA isolated by 
alkaline lysis (Maniatis et al.). The isolated plasmid DNA 
20 was digested with Ncol and BamHI. The digestion products 
were then electrophoretically separated on an agarose gel. 
The gel was transferred to nitrocellulose and probed with 
radioactively labelled SF3 as above. Twelve clones were 
confirmed to have the insert of interest by the Southern 
25 blot analysis. 

Clones were generated in the pET vector in essentially 
the same manner. 
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Example 17 

Immunological Ser^i'n q of th*> p nt ative HCV 
Capsid P rotein rinno= 
This example describes the immunological screening of 
the putative HCV capsid protein clones which were obtained 
in Example 18. 

Of the twelve clones obtained in Example 16, protein 
mini-lysates of 7 clones (clones # 8, 14, 15, 56, 60, 65, 
and 66) were prepared as described in Example 12. These 
mini-lysates were fractionated as described and transferred 
to nitrocellulose for Western Blot analysis. Table 13 
shows the pattern of immunoreaction which was observed when 
the 7 lysate preparations were screened with the indicated 
sera. 

Table 13 



Clone 


Sera 




SKF 


FEL 


A6 


B9 


BV 


8 










— 


14 




+ 


+ 


+ 


+ 


15 




+ 


+ 


+ 


+ 


56 




+ 


+ 


+ 


+ 


60 




+ 


+ 


+ 


+ 


65 




+ 


+ 


+ 


+ 


SJ26 












5-1-1 






+ 


+ 




409-1-1 






+ 


+ 





The serum samples used for screening were identified 
as follows: SKF, HCV negative; FEC, HCV positive; BV, 
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community acquired HCV; A6 and B9 correspond to human 
clinical serum samples which were HCV positive. 

Immunoreactive bands identified on the Western blot 
were all smaller than the expected size oi 50 kd (based on 
5 the predicted coding sequence of the cloned inserts, see 
below) . 

Clone 15 was chosen for scale-up production of the 
Sj26 fusion protein (Smith et al.). A one liter prepara- 
tion of clone 15 yielded about 200 ng of purified imraunore- 

10 active material. The bulk of the immunoreactive material 
appeared in a major doublet band which ran at approximately 
29 kd. The yield from this preparation was unexpectedly 
low: typically with the pGEX system a one liter protein 
preparation yields in the range of 50-100 mg fusion 

15 protein. 



Example 18 

Nucleic Acid Sequen ces of Clones 15 and 56 
The inserts of clones 15 and 56 (discussed in Example 
20 17) were sequenced as per the manufacturer's instructions 
(US Biochemical Corporation, Cleveland OH) using the 
dideoxy chain termination technique (Sanger, 1979) . Each 
of the clones had an open reading frame contiguous with the 
Sj26 reading frame of the pGEX vector. The sequences of 
25 the clone inserts were near identical with only a few minor 
sequence variations: the sequence of clone 15 had a 
termination codon starting at nucleotide position 126. The 
sequence data for clone 56 is presented as SEQ ID NO: 11 and 
in Figure 8A. 

30 The sequencing of the inserts revealed the unusual 

feature of a run of adenine residues from nucleotide 
position 25 to position 34 (Figure 8A) : such sequences are 
similar to sequences known to promote translation frame- 
shifting (Wilson et al., Atkins et al.). The open reading 

35 frame contiguous with the Sj26 coding sequence predicts a 
protein of approximately 23.5 kd. Accordingly, given the 
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approximately 26 kd size of the Sj26 protein fragment in 
this construct (Smith et al.), the complete fusion protein 
would be predicted to be approximately 50 kd. 

5 Example 19 

Hydropathic ity Plot of the Proton E ncoded hv cm™ ^ 
The SOAP program from IntelliGenetics PC /GENE™ soft- 
ware package was used to generate the hydropathicity plot 
of Figure 9. The SOAP program uses the method of Kyte et 

10 al. to plot the hydropathicity of the protein along its se- 
quence. The interval used for the computation was 11 amino 
acids. in Figure 9, the hydrophobic side of the plot 
corresponds to the positive values range and the hydrophil- 
ic side to the negative values range. 

15 The hydopathicity plot indicates (i) the hydrophilic 

nature of the amino terminus of the capsid protein, (ii) 
the relatively hydrophobic nature of the region of amino 
acid residues approximately 122 to 162, and (iii) the 
hydrophobic nature of amino acid residues approximately 

20 168-182. 

Further, the region of amino acid residues 168-182 
demonstrates potential for being a membrane spanning 
segment (Klein et al. ) . 

25 Example 20 

Deletion Analysis of +he Clone 5fi Protein Coding Region 
This example describes the generation of a series of 
carboxy and amino terminal deletions of the HCV capsid pro- 
tein and the effect of these deletions on the immunoreacti- 
30 vity of the resulting proteins. 

A. Carboxy Terminal Deletions of Clone 56. 
As one step to improve the expression of the HCV 
capsid protein, the putative region of translational 
frameshifting was modified to reduce the probability of a 
35 frameshift occurring in this region. In each AAA codon, 
encoding lysine, (nucleotide positions 25 to 33, Figure 8A) 
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the third nucleotide in each codon (positions 27, 30 and 
33, Figure 8A) was changed from A to G using standard PCR 
mismatch techniques (Ausubel et al., Mullis, Mullis et ' 
al.). The sites of these substitutions are indicated in 
5 Figure 8A by the three G's placed over the corresponding 
A's. The sequence of the modified pGEX clone was con- 
firmed as in Example 19 and the clone was named pGEX-CapA. 
The insert sequence of clone pGEX-CapA is shown in Figure 
8B and presented as SEQ ID NO: 13. 
10 The deletion clones were generated using the PCR 

primers given in Table 14. In Table 14 the BamHI site is 
italicized and the termination codon is underlined. 

Table 14 

CARBOXY TERMINAL DELETION PRIMERS 

15 



1. 


CI 


5'-CGA TCC ATG GGC ACG AAT CCT AAA CC 




2. 


NC580 


5'-G GCC GGA TCC TTA GGC CGA AGC GGG CAC AG 




3. 


NC520 


5'-G GCC GGA TCC TTA ACC AGG AAG GTT CCC TGT 


TGC 


4. 


NC450 


5'-G GCC GGA TCC TTA GGC CCT GGC ACG GCC TCC 




5. 


NC360 


5'-G GCC GGA TCC JTA. CAA ATT GCG CGA CCT ACG 


CC 


6. 


NC270 


5'-G GCC GGA TCC TTA GCC CTC ATT GCC ATA GAG 






Amplification reactions were carried out essentially 



as described in Example 16 using primer CI paired with each 
25 of the NC primers and purified plasmid pGEX-CapA as 
template: the amplification reaction was 1 minute at 95°, 
annealed 2 minutes at 50° and 3 minutes at 72° for 20 
cycles. 

The following sequence comparisons are given relative 
30 to the nucleic acid sequence presented in Figure 8B. The 
CI primer corresponds to the common 5 ' end of the pGEX-CapA 
insert which contains an Ncol site near the initiating 
methionine. The sequence of the NC primers each start at 
the nucleotide position indicated, for example, the homolo- 
35 gous sequence of the NC580 primer ends at nucleotide posi- 
tion 580. A termination codon is inserted at that posi- 
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10 



tion, following a BamHI site. The positions of the primers 
given in Table 14 are indicated in Figure 8B. The approxi- 
mate locations of the primers relative to the protein 
sequence are indicated in Figure 9. 

The resulting amplification products were electrophor- 
etically size fractionated on a polyacrylamide gel and the 
DNA products of the appropriate sizes electroeluted from 
the gel. The amplification products were cloned into both 
the pGEX and the pET vectors for expression. The sequences 
of the inserts were confirmed as described in Example 18. 

. The pGEX vectors containing the carboxy-terminal dele- 
tions were transformed into E. coli and the fusion proteins 
purified essentially as follows. Expression of the fusion 
protein was induced with IPTG for 3-4 hours. The cells 
15 were then harvested at 6,000 rpm for 10 minutes. The E. 
coli were then lysed in MTPBS buffer (150 mM NaCl; 16 mM 
N a2 HP0 4 ; 4 mM NaH 2 P0 4/ pH=8.0) after which 1% '•• TRITON X-100," 
3 A g/ml DNase I, and 1 mM PMSF were added. The lysates were 
centrifuged at 15,000 rpm for 20 minutes. The supernatant s 
were discarded and the pellets resuspended in 8M urea. The 
components of the resuspenion were separated by HPLC using 
a "BIO-GEL SP-5-PW" column. Typically, the fusion protein 
was the predominant peak: the location of the fusion pro- 
tein was confirmed by Western Blot analysis. Clones 
C1NC270, C1NC360, and C1NC450 all expressed Sj26 fusion 
proteins at high levels: the fusion proteins all corre- 
sponded to the size predicted from the insert coding se- 
quence fused to the Sj26 protein and were immunoreactive 
with HCV-positive sera (Western Blots were performed as 
30 described in Example 17) . Although the supernatants were 
discarded substantial amounts of the fusion proteins were 
also present in the supernatants. Clones C1NC520 and 
C1NC580 gave poor yeilds of fusion proteins. 

An epitope map of the HCV capsid region is presented 
35 in Figure 10: the location of the immunoreactive protein 
coding sequences corresponding to inserts C1NC450, C1NC360, 



20 



25 
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and C1NC270 are indicated. The sequences of C1NC450, 
C1NC360, and C1NC270 are presented in the Sequence Listing 
as SEQ ID NO: 15, SEQ ID NO: 17, and SEQ ID NO: 19, respec- 
tively. 

5 

B. Amino Terminal Deletions of Clone 56. 
Amino terminal deletion clones were generated using 
the PCR primers given in Table 15. 

10 Table 15 

AMINO TERMINAL DELETION PRIMERS 

1. C100 GAG CCC ATG GGT GGA GTT TAC TTG TTG CC 

2. C270 GAG CCC ATG GGC TGC GGG TGG GCG GG 

3. C360 GAG CCC ATG GGT AAG GTC ATC GAT ACC 

15 Amplification reactions were carried out essentially 

as described above using the primer pairs presented in 
Table 16 and purified plasmid pGEX-CapA as template: the 
amplification reaction included was 1 minute at 95°, 
annealed 2 minutes at 50°, and 3 minutes at 72° for 20 

20 cycles. 



Table 16 

NH 2 Primer COOH Primer Protein Produced? Immunoreactive? 



C100 NC450 LOW YES 

NC360 YES YES 

NC270 YES YES 

30 C270 NC450 YES NO 

NC360 YES NO 

C360 NC450 YES NO 



35 

The following sequence comparison are given relative 
to the nucleic acid sequence presented in Figure 8B where 
the above described A to G substitutions have been made for 
40 the sequence of pGEX-CapA. The NC660 primer corresponds to 
the common 3' end of the pGEX-CapA insert which contains a 



WO 91/15516 



PCT/US91/02370 



84 

BamHI site near the end of the insert. The sequence of the 
C primers each start at the nucleotide position indicated, 
for example, the sequence of the NC100 primer begins at 
nucleotide position 100. Each of the C primers introduces 
5 an in-frame initiation codon in the resulting amplification 
product. The positions of the primers given in Table 15 
are indicated in Figure 8B. 

The resulting amplification products were cloned into 
the pGEX and pET vector for expression as described above. 
10 The sequences of the inserts were confirmed. 

The pGEX vectors containing the carboxy-terminal 
deletions were transformed into E. coli, protein mini- 
lysates prepared, and the immunoreactivity of the proteins 
analyzed by Western Blots as described above. The results 
15 of the analysis are presented in Table 16. Clones 
C100NC270 and C100NC360 expressed Sj26 fusion proteins at 
high levels: the fusion proteins corresponded to the size 
predicted from the insert coding sequence fused to the Sj26 
protein. 

20 A 11 epitope map of the HCV capsid region is presented 

in Figure 10: the location of the protein coding sequences 
corresponding to inserts C100NC270, C100NC360, C270NC360, 
and C270NC450 are indicated. The sequences for C100NC270 
and C100NC360 are presented in the Sequence Listing as SEQ 

25 ID N0:21 and SEQ ID NO: 23, respectively. 

Example 21 

Expanded Immunoscreenina Using the Cans id Antigen 
This example describes three different comparisons of 
30 the immunoreactivity of the various HCV antigens of the 
present invention to several battery of sera. 
A. Effectiveness of Cap450 Antigen. 
Table 17 shows the results of 50 human sera samples 
from patients suspected of NANB hepatitis infection. The 
35 ELISA assays were performed essentially as described by 
Tijssen using the following 3 antigens: C100, 409-1-1 (c- 
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a) , C33u, Cap450 (the protein product of the pGEX-ClNC450 
clone), and with 409-1-1 (c-a) and cap4150 in one well which 
was optimized to give the most sensitive results. These 
ELISA data were compared with the Abbott CI 00 test. 
5 Patient serum was scored positive for Sj26 fusion pro- 

teins (409-1-1 ca, 33u, 5-1-1, and Cap450) if the absor- 
bance was three times the absorbance of that serum on Sj26 
native protein. A sample was scored positive on pET anti- 
gens (cap360) if the absorbance was three times the mean of 

10 the absorbance of negative control sera. A patient serum 
was scored positive on the combined 409-1-1 ca/cap450 assay 
if the absorbance was equivalent or greater than that of 
control positive sera. Samples within 10% of the control 
positive sera were scored weak positives. 

15 [Samples 1-19: Chronic active hepatitis proven by biopsy. 
HBS Ag(-) . 

Samples 20-44: Acute viral hepititis HBsAg(l), ISM Anti- 
HBC(-), IgM anti-HAV(-) . 

Samples 45-50: Chronic active hepatitis proven by biopsy. 
20 HBsAg(-) . 



Table 17 
Korean Panel II 



30 



35 





Sample 
# 


C100 


409-1-1 
(c-a) 


C33u 


Cap 
450 


Combined 
409-1-1 (c-a) 
+CAP450 








+ 


+ 


+ 


+ 


2 




+ 


+ 


+ 


+ 


+ 


3 




+ 


+ 


+ 


+ 


+ 


4 




+ 


+ 


+ 


+ 


+ 


5 




+ j 


+ 


+ 


+ 


+ 


6 






+ 


+ 


+ 


+ 


7 




+ 


+ 


+ 


+ 


+ 


8 




+ 


+ 


+ 


+ 


+ 
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9 




+ 


— 


+ 


+ 


+ 


10 




+ 




+ 


* 


+ 


11 




+ 


+ 


+ 




+ 


12 




+ 


+ 


+ 


+ 


+ 


13 




+ 


+ 


+ 


+ 


+ 


14 




+ 


+ 


+ 


+ 


+ 


15 




+ 


+ 


+ 


+ 


+ 


16 




+ 


+ 


+ 


* 


+ 


17 




+ 


+ 


+ 


+ 


+ 


18 




+ 


+ 


+ 


4- 


i 


19 




+ 










20 


945 












21 


988 


+ 


+ 


+ 


* ! 


-+ 


22 


3383 


+ 










23 


4072 












24 


4242 












25 


4490 













20 



* = positive (low) 
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15 





Sample 
# 


C100 


409-1-1 
1 (c-aj 


CJ JU 


Cap 
450 


Combined 
409-1-1 <c-a 
+CAP450 


£• D 


AR 1 A 










— 


27 


r t o ^ 

5322 








— 


— 


zo 


odUJ 










— 














— 


30 


9033 










— 




Q7CQ 
7 / DO 








— 


— 


•3 z 


7 / /O 










— 


J J 


10197 


+ 






+ 






10200 










— 


Jo 


10409 










— 


JO 


10811 








— 


— 


37 


11209 




+ 


+ 


+ 


ND 


38 


12245 










— 


39 


12143 






— 


— 


— 














— 


41 














AO 














43 


14188 












44 


13437 












45 


863 












46 


3354 












47 


12640 




+ 


+ 


+ 


+ 


48 


13095 




* 


+ 




w+ 


49 


14501 












50 


14345 


+ J 


+ 






+ 



* = positive (low) 



The results demonstrate that the Cap450 protein has 
good sensitivity for detecting the presence of anti-HCV an- 
tibodies in sera samples. Three additional samples (6, 37, 
and 47) were detected. Further, these results indicate 
that the combination of Cap450 and 409-1-1 (c-a) can be used 
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to produce a kit which is very effective for detection of 
anti-HCV antibodies in human sera samples. 

B. Cap450 and Cap360. 

The results in Table 18 demonstrate the effectiveness 
of the Cap450 and Cap360 antigen (the protein product 
encoded by of P ET-C1NC360) to detect HCV antibodies present 
in human sera. The samples were tested for the presence of 
HCV by ELISA using each individual antigen shown, or with 
409-1-1 (c-a) and Cap450 antigens combined in one well. 

Table 18 



15 



20 



25 



SERUM 


PATIENT DIAGNOSIS 


C100 


ELISA 
5-1-1 


409-1-1 

1 w a j 




Cap 


Combined 
(409-1-1) 


G-131 


Acute Hepatitis; Pt "CO." 














G-132 


Acute Hepatitis; Pt "CO," 




— 


— 


— 


- 


- 


G-143 


Acute Hepatitis; Pt "CO." 














G-285 


Acute Hepatitis; Pt "CO." 


ND 


ND 


ND 


ND 


ND 




G-150 


Acute P.T. Hepatitis; Pt 
"G.L." 






I 


I 


+ 


+ 


G-151 


Acute P.T. Hepatitis; Pt 
"G.L. n 






I 




+ 


+ 


G-152 


Acute P.T. Hepatitis; Pt 
-G.L." 










+ 


+ 


G-153 


Acute P.T. Hepatitis; Pt 
W G.L. M 






I 




+ 


+ 


G-286 


Acute P.T. Hepatitis; Pt 
n G.L. " 


ND 


ND 


ND 


ND 


ND 


+ 


G-43 


Fulminant Liver Disease 














G-l 


Community Acquired 
Hepatitis 


ND 


I 




+ 


+ 


+ 


G-109 


Community Acquired 
Hepatitis 


+ 




+ 


+ 


+ 


+ 


G-114 


Community Acquired 
Hepatitis 


ND 












G-128 


Community Acquired 
Hepatitis 


+ 




I 


+ 


+ 


+ 


G-3 


Community Acquired 
aepatitis 
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SERU2 


A PATIENT DIAGNOSIS 


C10I 


elisj 

D 5-1-J 


k 409-1-. 
. (c-a) 


l 

C33i 


Caj 
i 36C 


Combined 
) (409-1-1) 
) +Cap450 


G-126 


> Community Acquired 
Hepatitis 












+ 


0-12*3 


' Community Acquired 
Hepatitis 


+ 


I 


+ 


+ 






G-42 


Idiopath. Comm. Ac. 
Hepatitis 














G-51 


Community Acquired 
Hepatitis B 






+ 




+ 


+ 


G-27 


Community Acquired 
Hepatitis B 






1 1 








G-22" 


Community Acquired 
Hepatitis B 














G-40 


Community Acquired 
Hepatitis B 














G-31 


Community Acquired 
Hepatitis B 


+ 




+ 




+ 


+ 


G-45 


Community Acquired 
Hepatitis B 


— 


— 


- 


- 


- 


- 


G-38 


Fulminant Hepatitis B 


— 


— 


- 


- 


- 


- 


G-41 


Community Acquired 
Hepatitis C 






I 


+ 


I 


+ 


G-13 


Hepatitis C 


+ 


I 


+ 


+ 




+ 


G-12 


Hepatitis C 


+ 






+ 


+ 


+ 


G-6 


Hepatitis C 














G-49 


BtOH Cirrhosis 


_ 












G-25 


StOH Cirrhosis 














G-110 


StOH Cirrhosis 


_ 




+ 


+ 


I 


+ 


G-46 I 


StOH Cirrhosis 














G-272 ] 


[nfant Liver Transplant 


ND 


- 


- 


- 




- 


G-274 ] 


Infant Liver Transplant 


ND 




+ 


+ 






G-16 I 


>BC 






+ 


+ 






G-123 I 


NC LT 














G-122 I 


NC LT 




+ 










G-125 N 


o Diagnosis 




I 


+ ! 


+ 


I 


+ 


G-124 N 


o Diagnosis 








+ 


+ 


+ 



These results indicate that the combination of antigen 
409-1-1 (c-a) and Cap360 or Cap450 result in a effective 
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diagnostic tool for detection of HCV infection. Five 
additional samples (6150,0151, Glio, G125, and G124) were 
detected with these ELISA's compared with C100 test alone. 
C. pET360 

The results in Table 19 demonstrate the effectiveness 
of the pET360 to detect HCV antibodies present in human 
sera. The samples were tested for the presence of HCV by 
ELISA using each individual antigen shown, or with 409-1-1 
(c-a) and pET360 antigens combined in one well. 

Table 19 







C100 


5-1-1 


409-1-1 
(c-a) 


C33u 


pET360 


Combined 
409-1-1 (c-a) 
+ pET360 




A 


+ 




+ 


+ 


- 


+ 


15 


B 


+ 


4. 


+ 


+ 




+ 




C 


+ 






+ 




+ 




D 


+ 




+ 




+ 


+ 




E 


+ 




w+ 


+ 


+ 


+ 




F 














20 


6 


+ 




w+ 


+ 


+ 


+ 




H 






+ 


+ 


+ 


+ 




I 
















J 
















K 








+ 


+ 


+ 


25 


L 
















M 
















N 




w+ 




+ 


+ 


+ 




O 


+ 


w+ 


+ 


+ 


+ 


+ 




P 


+ 


w+ 


+ 


+ 


+ 


+ 


30 


Q 
















R 
















S 















These results indicate that the combination of antigen 409-1-1 (c- 
a) and pET360 result in a effective diagnostic tool for detection of 
HCV infection. Three additional samples were detected with these 
ELISA's compared with C100 test alone. 
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Although the invention has been described with reference to 
particular embodiments, methods, construction and use, it will be 
apparent to those skilled in the art that various changes and modifica- 
tions can be made without departing from the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Reyes, Gregory 
Kim, Jungsuh P. 
Moeckli, Randolph 
Simonsen, Christian C. 

(ii) TITLE OF INVENTION: Hepatitis C Virus Epitopes 

(iii) NUMBER OF SEQUENCES: 26 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Peter J. Dehlinger 

(B) STREET: 350 Cambridge Ave., Suite 100 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94306 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 505 , 611 

(B) FILING DATE: 06-APR-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 594,854 

(B) FILING DATE: 09-OCT-1990 

(viii) ATTORNEY/AGENT INFORMATION: 
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(A) NAME: Fabian, Gary R. 

(B) REGISTRATION NUMBER: 33,875 

(C) REFERENCE /DOCKET NUMBER: 4600-076.21 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 323-8302 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 304-12-1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..561 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAA TTC CTC GTG CAA GCG TGG AAG TCC AAG AAA ACC CCA ATG GGG TTC 
Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly Phe 
1 5 io 15 



TCG TAT GAT ACC CGC TGC TTT GAC TCC ACA GTC ACT GAG AGC GAC ATC 
Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp He 
20 25 30 



96 



WO 91/15516 



PCT/US91/02370 



94 

CGT ACG GAG GAG GCA ATC TAC CAA TGT TGT GAC CTC GAC CCC CAA GCC 
Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala 
35 40 45 



144 



CGC GTG GCC ATC AAG TCC CTC ACC GAG AGG CTT TAT GTT GGG GGC CCT 
Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro 
50 55 60 



192 



CTT ACC AAT TCA AGG GGG GAG AAC TGC GGC TAT CGC AGG TGC CGC GCG 
Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala 
65 70 75 80 



240 



AGC GGC GTA CTG ACA ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 
Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr He 
85 90 95 



288 



AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC CAG GAC TGC ACC ATG 
Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met 
100 105 no 



336 



CTC GTG TGT GGC GAC GAC TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC 
Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly Val 
115 120 125 



384 



CAG GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG GCT ATG ACC AGG 
Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg 
130 135 140 



432 



TAC TCC GCC CCC CCC GGG GAC CCC CCA CAA CCA GAA TAC GAC TTG GAG 
Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu 
145 150 155 160 



480 



CTC ATA ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC GAC GGC GCT 
Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly Ala 
165 170 175 



528 



GGA AAG AGG GTC TAC TAC CTC ACC CGG GAA TTC 
Gly Lys Arg Val Tyr Tyr Leu Thr Arg Glu Phe 
180 185 



561 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Glu Phe Leu Val Gin Ala Trp Lye Ser Lys Lys Thr Pro Met Gly Phe 
15 io 15 

Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp He 
20 25 30 

Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala 
35 40 45 

Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro 
50 55 60 

Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala 
65 70 75 80 

Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr He 
85 90 95 

Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met 
100 105 HO 

Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly Val 
115 120 125 

Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg 
130 135 140 

Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu 
145 150 155 160 

Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly Ala 
165 170 175 



Gly Lys Arg Val Tyr Tyr Leu Thr Arg Glu Phe 
180 185 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to raRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis HCV Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 303-1-4 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..252 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AAC TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA ACA CCA ATA GAC 48 
Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr Pro He Asp 
15 10 15 

ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT CAG CCT GAG AAG 96 
Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys 
20 25 30 

GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG GGC GTG 144 
Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val 
35 40 45 



CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT ACC AAG 
Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr Lys 
50 55 6 o 



CTC 
Leu 



CCC 
Pro 



192 
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TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAC TCA CCA GGA CAG 
Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin 
65 70 75 80 

CGG GTT GAA TTC 
Arg Val Glu Phe 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr Pro He Asp 
1 5 io is 

Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys 
20 25 30 

Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val 
35 40 45 

Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr Lys Leu Pro 
50 55 60 

Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin 
65 70 75 80 

Arg Val Glu Phe 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 303-1-4 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1512 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAA TTC TTC ACA GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG CCC 48 
Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro 

CCC TGC AAG CCC TTG CTG CGG GAG GAG GTA TCA TTC AGA GTA GGA CTC 96 
Pro Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu 
20 25 30 

CAC GAA TAC CCG GTA GGG TCG CAA TTA CCT TGC GAG CCC GAA CCG GAT 144 
His Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp 
35 40 45 

GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC TCC CAT ATA ACA GCA 192 " 

Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala 

50 55 60 * 

GAG GCG GCC GGG CGA AGG TTG GCG AGG GGA TCA CCC CCC TCT GTG GCC 240 
Glu Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala 
65 70 75 80 
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AGC TCC TCG GCT AGC CAG CTA TCC GCT CCA TCT CTC AAG GCA ACT TGC 288 
Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys 
85 90 95 

ACC GCT AAC CAT GAC TCC CCT GAT GCT GAG CTC ATA GAG GCC AAC CTC 336 
Thr Ala Asn His Asp Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu 
100 105 HO 

CTA TGG AGG CAG GAG ATG GGC GGC AAC ATC ACC AGG GTT GAG TCA GAA 384 
Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val Glu Ser Glu 
115 120 125 

AAC AAA GTG GTG ATT CTG GAC TCC TTC GAT CCG CTT GTG GCG GAG GAG 432 
Asn Lys Val Val He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu 
130 135 140 

GAC GAG CGG GAG ATC TCC GTA CCC GCA GAA ATC CTG CGG AAG TCT CGG 480 
Asp Glu Arg Glu He Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg 
145 150 155 160 



AGA TTC GCC CAG GCC CTG CCC GTT TGG GCG CGG CCG GAC TAT AAC CCC 528 
Arg Phe Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro 
165 170 175 

CCG CTA GTG GAG ACG TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG GTC 576 
Pro Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val 
180 185 190 

CAT GGC TGT CCG CTT CCA CCT CCA AAG TCC CCT CCT GTG CCT CCG CCT 624 
His Gly Cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro 
195 200 205 

CGG AAG AAG CGG ACG GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC 672 
Arg Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 
210 215 220 

TTG GCC GAG CTC GCC ACC AGA AGC TTT GGC AGC TCC TCA ACT TCC GGC 720 
Leu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly 
225 230 235 240 



ATT ACG GGC GAC AAT ACG ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC 
He Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly 
245 250 255 



768 
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TGC CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC ATG CCC CCC CTG 816 
Cys Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu 
260 265 270 

GAG GGG GAG CCT GGG GAT CCG GAT CTT AGO GAC GGG TCA TGG TCA ACG 864 
Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr 
275 280 285 

GTC AGT AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC TCA ATG TCT 912 
Val Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser 
290 295 3 0 o 



TAC TCT TGG ACA GGC GCA CTC GTC ACC CCG TGC GCC GCG GAA GAA CAG 
Tyr Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin 
305 310 315 320 



TCC GTA GAG GAA GCT TGC AGC CTG ACG CCC CCA CAC TCA GCC AAA TCC 
Ser Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser 
385 390 395 400 



960 



AAA CTG CCC ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC CAC AAT 1008 
Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn 
325 330 335 

TTG GTG TAT TCC ACC ACC TCA CGC AGT GCT TGC CAA AGO CAG AAG AAA 1056 
Leu Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys 
340 345 350 

GTC ACA TTT GAC AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC GTA 1104 
Val Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val 
355 360 365 

CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA GTG AAG GCT AAC TTG CTA 1152 
Leu Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu 
37 ° 375 380 



1200 



AAG TTT GGT TAT GGG GCA AAA GAC GTC CGT TGC CAT GCC AGA AAG GCC 1248 
Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala 
405 410 415 

GTA ACC CAC ATC AAC TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA 1296 
Val Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val 
* 2 ° 425 430 
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ACA CCA ATA GAC ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT 1344 
Thr Pro lie Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val 
435 440 445 

CAG CCT GAG AAG GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC 1392 
Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro 
450 455 4 60 

GAT CTG GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT 1440 
Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val 
465 470 475 " 480 

ACC AAG CTC CCC TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAC 1488 
Thr Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr 
485 490 495 



TCA CCA GGA CAG CGG GTT GAA TTC 
Ser Pro Gly Gin Arg Val Glu Phe 
500 



1512 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro 
15 10 is 

Pro Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu 
20 25 30 

His Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp 
35 40 45 



Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala 
50 55 60 
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Glu Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala 
65 70 75 go 

Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ma Thr CyB 
85 90 95 

Thr Ala Asn His Asp Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu 
100 105 no 

Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val Glu Ser Glu 
115 120 125 

Asn Lys Val Val He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu 
130 135 140 

Asp Glu Arg Glu He Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg 
145 150 155 160 

Arg Phe Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro 
165 170 175 

Pro Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val 
180 185 190 

His Gly cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro 
195 200 205 

Arg Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 
210 215 220 

Leu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly 
225 230 235 240 

He Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly 
245 250 255 



Cys Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu 
260 265 270 



Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr 
275 280 285 
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Val Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser 
290 295 300 

Tyr Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin 
305 310 315 320 

Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn 
325 330 335 

Leu Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys 
340 345 350 

Val Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val 
355 360 365 

Leu Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu 
370 375 380 

Ser Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser 
385 390 395 400 

Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala 
405 410 415 

Val Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val 
420 425 430 

Thr Pro He Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val 
435 440 445 

Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro 
450 455 460 

Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val 
465 470 475 480 



Thr Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr 
485 490 495 



Ser Pro Gly Gin Arg Val Glu Phe 
500 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(C) INDIVIDUAL ISOLATE: Rodney 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 409-1-1 (c-a) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAA TTC CGC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGG GCG TAC ATG 48 
Glu Phe Arg Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 
1 5 io 15 

AAC ACT CCG GGG CTT CCC GTG TGC CAG GAC GGA ATT CCG TCC CCG TCC 96 
Asn Thr Pro Gly Leu Pro Val Cys Gin Asp Gly He Pro Ser Pro Ser 
20 25 30 

ACC ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTC GAA GTA 144 
Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val 
35 40 45 

ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG TGC 192 
He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys 
50 55 60 
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GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG GCC 
Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly lie Asn Ala Val Ala 
65 70 75 80 



240 



TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT GTT 
Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 
85 90 95 



288 



GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC TTC 
Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 
100 105 110 



336 



GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT TTC 
Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 
115 120 125 



384 



AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG GAT 
Ser Leu Asp Pro Thr Phe Thr He Glu Thr He Thr Leu Pro Gin Asp 
130 135 140 



432 



GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC ACG GAA TTC 
Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Thr Glu Phe 
145 150 155 



477 



(2) INFORMATION FOR SEQ ID NO: 8; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Glu Phe Arg Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 
15 io 15 



Asn Thr Pro Gly Leu Pro Val Cys Gin Asp Gly He Pro Ser Pro Ser 
20 25 30 
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Thr Thr Gly Glu lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val 
35 40 45 



lie Lys Gly Gly Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys 
50 55 



60 



Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly lie Asn Ala Val Ala 
65 70 75 so 

Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 
85 90 95 



Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 
100 105 



110 



Asp Ser val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 



115 



120 



125 



Ser Leu Asp Pro Thr Phe Thr lie Glu Thr lie Thr Leu Pro Gin Asp 
130 135 



140 



Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Thr Glu Phe 
145 iso 15S 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 409-1-1 (abc) 
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(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..558 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TCC ACC ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTC GAA 48 
Ser Thr Thr Gly Glu lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu 
15 10 15 

GTA ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG 96 
Val He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys 
20 25 30 

TGC GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG 144 
Cys Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val 
35 40 45 

GCC TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT 192 
Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp 
50 55 60 

GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC 240 
Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp 
65 70 75 80 

TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 288 
Phe Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp 
85 go 95 

TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG 336 
Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr He Thr Leu Pro Gin 
100 105 no 

GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG 384 
Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys 
115 120 125 



CCA GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC CCC TCC GGC ATG 
Pro Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met 
130 135 140 



432 



WO 91/15516 



PCT/US91/02370 



108 

TTC GAC TCG TCC GTC CTC TGT GAG TGC TAT GAC GCA GGC TGT GCT TGG 480 
Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Aep Ala Gly Cys Ala Trp 
"5 150 155 * 160 

f 

TAT GAG CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA GCG TAC ATG 528 
Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 
165 170 175 

AAC ACC CCG GGG CTT CCC GTG TGC CAG GAC 558 
Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu 
1 5 io 15 

Val He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys 
20 25 30 

Cys Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val 
35 40 45 

Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp 
50 55 60 

Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp 
65 70 75 80 

Phe Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp 
85 90 95 
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Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr He Thr Leu Pro Gin 
100 105 no 

Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys 
115 120 125 

Pro Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met 
130 135 140 

Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp 
14 5 150 155 160 

Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 
165 170 175 

Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
180 185 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: GG1 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..657 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG GGC ACQ AAT CCT AAA CCT CAA AAA AAA AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Ann Pro Lys Pro Gin Lys Lys Asn Lye Arg Asn Thr Ann 
15 10 15 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 . 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 144 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 go 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 go 95 

CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 336 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 no 

CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

GGC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC GTC GGC GCC CCT CTT 432 
Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
"0 135 140 



GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 * i 6 o 



480 
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GGC GTG AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 



528 



TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 



576 



CAA GTG CGC AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT 
Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 



624 



AAC TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 
Asn Ser Ser He Val Tyr Glu * * Gly Ser 
210 215 



657 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 io i 5 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 
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Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

as 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 



Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 



125 



Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 1?5 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 



Asn Ser Ser He Val Tyr Glu * * Gly Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: CDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 
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(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: CapA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..657 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 is 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 144 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 336 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 
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GGC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC GTC GGC GCC CCT CTT 432 
Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC 480 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 "0 155 160 

GGC GTG AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 528 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 1?s 

TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC 576 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

CAA GTG CGC AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT 624 
Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
"5 200 205 



AAC TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 
Asn ser Ser He Val Tyr Glu * * Gi y Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 



r 



657 
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Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 no 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 

Asn Ser Ser He Val Tyr Glu * * Gly Ser 
210 215 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC450 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..453 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
15 10 15 



48 



CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 144 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro » 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 ' 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 go g 5 
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CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 no 



336 



CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC 
Arg Arg Arg Ser Arg Aen Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 



384 



GGC TTC GCC GAC CTC ATG GGG TAG ATA CCG CTC GTC GGC GCC CCT CTT 
Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 



432 



GGA GGC GCT GCC AGG GCC TA 
Gly Gly Ala Ala Arg Ala 
145 150 



453 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 io is 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 
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Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
1Q 0 105 no 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 



Gly Gly Ala Ala Arg Ala 
145 ISO 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC360 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..360 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lye Asn Lys Arg Asn Thr Asn 
1 5 io 15 



48 



CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 



96 



GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 



144 



ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



192 



ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 
lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 



240 



TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 



288 



CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 no 



336 



CGG CGT AGG TCG CGC AAT TTG TA 
Arg Arg Arg Ser Arg Asn Leu 

115 120 



360 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Met Gly Thr Asn Pro Ly 8 Pro Gin Lys Lye Asn Lye Arg Asn Thr Asn 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin Tie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 



85 



90 



95 



Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 no 

Arg Arg Arg Ser Arg Asn Leu 
115 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC270 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..273 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



ATG GGC AC6 AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lye Arg Asn Thr Asn 
1 5 10 15 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 2 5 30 



GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 
Gly val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 



TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TA 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
85 90 



144 



ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 



273 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
Met Gly Thr Asn Pro Ly 8 Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 



10 



15 



Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 . 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 

70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
85 90 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C100NC270 

(ix) FEATURE: 
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(A) NAME/KEY: COS 

(B) LOCATION: 1..183 



(xi) SEQUENCE DESCRIPTION: SEQ- ID NO:21: 

ATG GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG 48 
Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 

CGC GCG ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT 96 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
20 25 30 

CAG CCT ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG 144 
Gin Pro lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 45 



CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TA 
Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
50 55 6 o 



183 



(2) INFORMATION FOR SEQ ID NOs22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 

Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
20 25 30 



Pro He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 45 
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Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
50 55 60 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C100NC360 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..270 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATG GGT GGA GTT TAG TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG 48 
Met Gly. Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
15 10 is 



CGC GCG ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
20 25 30 



96 



CAG CCT ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG 144 
Gin Pro He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 45 
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CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG 
Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala 
50 55 60 

GGA TGG CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA 
Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
65 70 75 80 



GAC CCC CGG CGT AGG TCG CGC AAT TTG TA 270 
Asp Pro Arg Arg Arg Ser Arg Asn Leu 

85 90 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
15 10 15 

Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
20 25 30 

Gin Pro He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 45 

Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala 
50 55 60 

Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
65 70 75 80 

Asp Pro Arg Arg Arg Ser Arg Asn Leu 
85 
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(2) INFORMATION FOR SEQ ID NO: 25: 

f 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC105 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..106 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

15 10 15 



GGA GTT TTA A 
Gly Val Leu 
35 



48 



CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 



106 



(2) INFORMATION FOR SEQ ID NO:26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid . ' 
(D) TOPOLOGY: linear #' 

< 

(ii) MOLECULE TYPE: protein * 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 



Gly Val Leu 
35 
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-^TCCTTCCTCC 
LTAGSAAGGACC 
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961 




1M1 ^^^^^^^^^ 



1261 
1321 



1441 ^S^^gSteffiSj" 
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2041 SSSiSiSS^S^ 

2151 ste^s^ssi^s^s 



2281 
2341 



2401 ^^=SSJg~s^«~ 



3701 

^^^^^^^^^^ 



2881 WGGCATCAAlGCCCTGGCCTACTACCGCGGTCTTear^ 



WO 91/15516 PCT/US91/02370 

4 of 8 

APPENDIX 

2941 SSSSS^^ 

CGCIAGUOCCUjCACCGTIGGCTACCGGAGTACTGGCCCATATCGCCCCTCAAGCTW 

3001 £S5^^ 
3061 29S5!^ 

COGACCSTCCCCCXTCGTO 

3181 GClTGTTCGACIC G I CCG I C CTCTgTGAGTGClAlGACGCnGG C TGIG C IT GG TAIg 
CGIACAAGClCaGCaGGCaGGaGACACICACGA 

3241 




3541 2£2^ 

GCCGGClBtSftCCICCAGCaGTGCTCGlGGA 
ACCCGCGCA»ACGGAC»GTOTCCGAC6a«CAGTATC^ 
OCIICGGCCGmGXAJGGACIGTCeCXXCAGGAGATC^ 
TCAOSAGAfiTOGTGAAXGGCAXGTAGCTCCSXCCCTACTACGAGQSGC^ 
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3901 ^^^^^^^^s^^s 

" S1 ^^^^^^^^^ ' 



4021 



40,1 sss^^ 

4SW ^^^^SSSS^ 

4261 




4441 S^g^SSSgglggS 

4681 

4741 ssssssssssssss^sssssssssss^ 
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4921 HSKESK^ 
4981 JSii^^ 

AGTTTACGGCCACGGTCCUSH^AGCGGGCIIAASAaGlCTCTTAACClGCCCG^ 

5041 HS5£^^ 
S101 SSSS^^ 

TGTTGAOSTCC^CTCACTGATCCCTCCCATATAAearff^aMraMrraM 



5221 sss^^ 

S281 

5341 iHSSH^^ 

5401 ISiSSS^^ 

5521 

5581 SSKKS^ 

ICGCCTCaaCCAGGAGlGACTTAGTTGGGATAW 
5701 2}£gg"gl^^ 

aiaSAAACCanCtyuSGaGTOAAGCCCGTAATCC^ 

3761 ACCCOGamiCIGGCireCCCCCCGACTCa^cra 
flCGGGCGGGGAAGACCGACQWGGGGCTGAl^^ 

5821 CttIGGA«aWCMCCTCC^TCCGCATC^ 
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5881 SHS!i25^^ 
5941 SSffSS^ 

AGCAGTGGGGCICGCGGCGCCXTCI IU1LI Il\»nCCG5I«6TTACGlGATTCGTICa6CA 

6001 5255^^ 

ACGAlGCftGIGGIGTTAAACCACAIAKroGGIGGA^^ 
6061 JSJJGIO^^ 

6121 

TCttAITTCGICGCCGCAGTTTTCACTTCCG^ 
6181 ^^^^^^^^^^^^^^ ^^^^^ ^ 

mi ^^^^^^^^^^^^ 

6301 Si§g£3gg§3^^ 

6421 SS^S^^ 
6541 SSSS^^ 

66XXCXXXSSGGGTZACCCCAM3A6GAZACZ&SG660SACG3UUO 

6601 tocgctgiaggcScccic^ 

6781 CglOfi C^ ^ 

^^"^ ^^'^GAGCACACACCGCTGCTGAATCAGCaATAGAQ UlTT ' l ' CG CGCCCCCAK 
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APPENDIX 



6901 SSSiS^ 

7M1 

7091 SS^^^Sgg-^-S 

7 251 CCTGCIACTttAIA GAACC 

ggacgaigaqgiatc i «.,a7T 3 r 3P ™ 6 — imcisac 
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IT IS CLAIMED: 

1. A recombinant polypeptide which is immunoreactive 
with sera from humans infected with hepatitis C virus 
(HCV) . 

2. The polypeptide of claim 1, which includes an 
immunoreactive portion of an HCV polypeptide which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4. 



3. The polypeptide of claim 2, which includes the 
entire 504 residue peptide. 

4. The polypeptide of claim 3, whose carboxy- 
terminal amino acid sequence is encoded by the polynucleo- 
tide sequence presented as SEQ ID NO: 3. 

5. The polypeptide of claim 3, which is produced by 
the expression vector contained in an Escherichia coli host 
identified by ATCC No. 40901. 

6. The polypeptide of claim l, which includes an 
25 immunoreactive portion of the peptide sequence presented as 

SEQ ID NO: 2. 

7. The polypeptide of claim 6, which includes the 
specified sequence. 



8. The polypeptide of claim 7, which is encoded by 
the polynucleotide sequence presented as SEQ ID N0:l. 
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9. The polypeptide of claim 7, which is produced by 
the expression vector contained in an Escherichia coli host 
identified by ATCC No. 40893. 

10. The polypeptide of claim 1, which includes the 
peptide sequence presented as SEQ ID NO: 8. 



11. The polypeptide of claim 10, which is encoded by 
the polynucleotide sequence presented as SEQ ID NO: 7. 

10 

12. The polypeptide of claim 1, which includes the 
peptide sequence presented as SEQ ID NO: 10. 

13. The polypeptide of claim 12, which is encoded by 
15 the polynucleotide sequence presented as SEQ ID NO: 9. 

14. The polypeptide of claim 1, which includes a 
peptide sequence selected from the group consisting of the 
peptide sequences presented as SEQ ID NO: 12, SEQ ID NO: 14, 

10 SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ 
ID NO:24, and SEQ ID NO:26. 



15. The polypeptide of claim 1, which is encoded by 
a polynucleotide sequence selected from the group of poly- 
nucleotide sequences presented as SEQ ID NO: 11, SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID 
NO:21, SEQ ID NO:23, and SEQ ID NO:25. 

16. A diagnostic kit for use in screening human blood 
containing antibodies specific against hepatitis c virus 
(HCV) infection comprising 

at least one peptide antigen which is immunoreactive 
with sera from humans infected with hepatitis C virus 
(HCV), and 

means for detecting the binding of said antibodies to 
the antigen. 
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17. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of an HCV polypeptide sequence 
which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO:4. 

18. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of the peptide sequence presented 
as SEQ ID NO: 2. 



19. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of the peptide sequence presented 

15 as SEQ ID NO: 8. 

20. The kit of claim 19, which further contains a 
second antigen, wherein the second antigen includes an 
immunoreactive portion of a peptide sequence selected from 
the group consisting of the peptide sequences presented as 
SEQ ID NO:12, SEQ ID NO:14, SEQ ID N0:16, SEQ ID NO:18, SEQ 
ID NO:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ ID NO:26. 



21. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of the peptide sequence presented 
as SEQ ID NO: 10. 

22. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of a peptide sequence selected 
from the group consisting of the peptide sequences present- 
ed as SEQ ID N0:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID 
NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ ID 
NO:26. 

23. The kit of claim 16, wherein said detecting means 
includes a solid support to which said peptide is attached 
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and a reporter-labeled anti-human antibody, wherein binding 
of said serum antibodies to said antigen can be detected by 
binding of the reporter- labeled antibody to said solid 
surface. 

24. A method of detecting hepatitis c virus (HCV) 
infection in an individual comprising 

reacting serum from an HCV-infected test individual 
with at least one peptide antigen which is immunoreactive 
with sera from humans infected with hepatitis C virus 
(HCV), and 

examining the antigen for the presence of bound 
antibody. 

15 25. The method of claim 24, wherein the peptide 

antigen includes an immunoreactive portion of an HCV 
peptide which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4. 



20 



26. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 
25 presented as SEQ ID NO: 2. 



27. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 
presented as SEQ ID NO: 8. 

30 

28. The method of claim 24, which further contains a 
second antigen, wherein the second antigen includes an 
immunoreactive portion of a peptide sequence selected from 
the group consisting of the peptide sequences presented as 

35 SEQ ID NO:12, SEQ ID N0:14, SEQ ID N0:16, SEQ ID NO:18, SEQ 
ID NO:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ ID N0:26. 
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29. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 
presented as SEQ ID NO: 10. 

5 30. The method of claim 24, wherein the antigen 

includes an immunoreactive portion of a peptide sequence 
selected from the group consisting of the peptide sequences 
presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ 
10 ID NO: 26. 

31. The method of claim 24, wherein the peptide 
antigen is attached to a solid support, said reacting 
includes reacting the peptide antigen with the support, and 
15 subsequently reacting the support with a reporter-labeled 
anti-human antibody, and said examining includes detecting 
the presence of reporter-labeled antibody on the solid 
support. 



20 32. A method of producing a polypeptide which is 

immunoreactive with sera from humans infected with hepati- 
tis C virus (HCV) , comprising 

introducing into a suitable host, a recombinant 
expression system containing an open reading frame (ORF) 

25 having a polynucleotide sequence which encodes a polypep- 
tide which is immunoreactive with sera from humans infected 
with hepatitis C virus (HCV) , where the vector is designed 
to express the ORF in said host, and 

culturing said host under conditions resulting in the 

30 expression of the ORF sequence. 

33. The method of claim 32, wherein the polypeptide 
includes an immunoreactive portion of an HCV polypeptide 
which: 

35 a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 
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c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
5 presented as SEQ ID NO: 3. 

34. The method of claim 33, wherein the expression 
vector is a lambda gtll phage vector, the host is E. coli, 
and the host containing the introduced vector is identified 

10 by ATCC No. 40901. 

35. The method of claim 32, wherein the polypeptide 
has the sequence presented as SEQ ID NO: 2, and the 
polynucleotide has the sequence presented in SEQ ID N0:l. 

15 

36. The method of claim 35, wherein the expression 
vector is a lambda gtll phage vector, the host is E. coli, 
and the host containing the introduced vector is identified 
by ATCC No. 40893. 

!0 

37. The method of claim 32, wherein the polypeptide 
has the sequence presented as SEQ ID NO: 8, and the polynu- 
cleotide has the sequence presented as SEQ ID NO: 7. 

5 38. The method of claim 37, wherein the expression 

vector is a lambda gtll phage vector, the host is E. coli, 
and the host containing the introduced vector is identified 
by ATCC No. 40792. 

3 39. The method of claim 32, wherein the polypeptide 

has the sequence presented as SEQ ID NO:lo, and the 
polynucleotide has the sequence presented as SEQ ID NO: 9. 
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40. The method of claim 39, wherein the expression 
vector is a lambda gtll phage vector, the host is E. coll, 
and the host containing the introduced vector is identified 
by ATCC No. 40876. 

41. The method of claim 32, wherein the polypeptide 
includes an immunoreactive portion of a peptide sequence 
selected from the group consisting of the peptide sequences 
presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID N0:18, SEQ ID N0:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ 
ID NO: 26. 

42. The method of claim 41, wherein the expression 
vector is a pGEX or pET vector and the host is E. coli. 

43. An expression system for expressing a recombinant 
peptide antigen which is immunoreactive with sera from 
humans infected with hepatitis c virus (HCV) , comprising 

a host capable of supporting expression of an open 
reading frame in a selected expression vector, and 

the selected expression vector containing an open 
reading frame (ORF) having a polynucleotide sequence which 
encodes said peptide antigen. 

25 44 • The expression system of claim 43, wherein the 

peptide antigen includes an immunoreactive portion of an 
HCV polypeptide sequence which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
presented as SEQ ID NO: 3. 

35 



20 



30 
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45. The expression system of claim 44, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40901. 

5 

46. The expression system of claim 43, wherein the 
peptide antigen has the sequence presented as SEQ ID NO: 2, 
and the polynucleotide has the sequence presented as SEQ ID 
NO:l. 

10 

47. The expression system of claim 46, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40893. 

15 

48. The expression system of 43, wherein the peptide 
antigen has the sequence has the sequence presented as SEQ 
ID NO: 8, and the polynucleotide has the sequence presented 
as SEQ ID NO: 7. 

20 

49. The expression system of claim 48, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40792. 

25 

50. The expression system of 43, wherein the peptide 
antigen has the sequence has the sequence presented as SEQ 
ID NO: 10, and the polynucleotide has the sequence presented 
as SEQ ID NO: 9. 

30 

51. The expression system of claim 50, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40876. 



35 
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52. The expression system of claim 43, wherein the 
polypeptide includes an immunoreactive portion of a peptide 
sequence selected from the group consisting of the peptide 
sequences presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 

5 NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:24, and SEQ ID NO:26. 

53. The expression system of claim 52, wherein the 
expression vector is a pGEX or pET vector and the host is 

10 E. coli. 

54. A polynucleotide which encodes a polypeptide 
which is immunoreactive with sera from humans infected with 
hepatitis c virus (HCV) . 

15 

55. The polynucleotide of claim 54, wherein the 
polypeptide includes an immunoreactive portion of a peptide 
sequence which: 

a) is encoded by an HCV coding sequence; 
20 b ) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
25 presented as SEQ ID NO: 3. 

56. The polynucleotide of claim 54, which includes 
the polynucleotide sequence presented as SEQ ID N0:l. 



30 



57. The polynucleotide of claim 54, which includes 
the polynucleotide sequence presented as SEQ ID NO: 9. 



58. The polynucleotide of claim 54, which includes a 
polynucleotide sequence selected from the group of polynu- 
35 cleotide sequences presented as SEQ ID N0:ii, SEQ ID NO: 13, 
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SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ 
ID NO: 23, and SEQ ID NO: 25. 
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GAA TTC CTC 
TCG TAT GAT 
CGT ACG GAG 
CGC GTG GCC 
CTT ACC AAT 
AGC GGC GTA 
AAG GCC CGG 
CTC GTG TGT 
CAG GAG GAC 
TAC TCC GCC 
CTC ATA ACA 
GGA AAG AGG 



GTG CAA GCG 
ACC CGC TGC 
GAG GCA ATC 
ATC AAG TCC 
TCA AGG GGG 
CTG ACA ACT 
GCA GCC TGT 
GGC GAC GAC 
GCG GCG AGC 
CCC CCC GGG 
TCA TGC TCC 
GTC TAC TAC 



TGG AAG 
TTT GAC 
TAC CAA 
CTC ACC 
GAG AAC 
AGC TGT 
CGA GCC 
TTA GTC 
CTG AGA 
GAC CCC 
TCC AAC 
CTC ACC 



TCC AAG 

TCC ACA 

TGT TGT 

GAG AGG 

TGC GGC 

GGT AAC 

GCA GGG 

GTT ATC 

GCC TTC 

CCA CAA 

GTG TCA 

CGG 
- R9 



AAA ACC 
GTC ACT 
GAC CTC 
CTT TAT 
TAT CGC 
ACC CTC 
CTC CAG 
TGT GAA 
ACG GAG 
CCA GAA 
GTC GCC 



CCA ATG 
GAG AGC 
GAC CCC 
GTT GGG 
AGG TGC 
ACT TGC 
GAC TGC 
AGC GCG 
GCT ATG 
TAC GAC 
CAC GAC 



GGG TTC 
GAC ATC 
CAA GCC 
GGC CCT 
CGC GCG 
TAC ATC 
ACC ATG 
GGG GTC 
ACC AGG 
TTG GAG 
GGC GCT 



Pig. 3 
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GAA TTC TTC ACA GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG CCC 
CCC TGC AAG CCC TTG CTG CGG GAG GAG GTA TCA TTC ? AGA GTA GGA CTC 
CAC GAA TAC CCG GTA GGG TCG CAA TTA CCT TGC GAG CCC GAA CCG GAT 
GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC TCC CAT ATA ACA GCA 
GAG GCG GCC GGG CGA AGG TTG GCG AGG GGA TCA CCC CCC TCT GTG GCC 
AGC TCC TCG GCT AGC CAG CTA TCC GCT CCA TCT CTC AAG GCA ACT TGC 
ACC GCT AAC CAT GAC TCC CCT GAT GCT GAG CTC ATA GAG GCC AAC CTC 
CTA TGG AGG CAG GAG ATG GGC GGC AAC ATC ACC AGG GTT GAG TCA GAA 
AAC AAA GTG GTG ATT CTG GAC TCC TTC GAT CCG CTT GTG GCG GAG GAG 
GAC GAG CGG GAG ATC TCC GTA CCC GCA GAA ATC CTG CGG AAG TCT CGG 
AGA TTC GCC CAG GCC CTG CCC GTT TGG GCG CGG CCG GAC TAT AAC CCC 
CCG CTA GTG GAG ACG TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG GTC 
CAT GGC TGT CCG CTT CCA CCT CCA AAG TCC CCT CCT GTG CCT CCG CCT 
CGG AAG AAG CGG ACG GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC 
TTG GCC GAG CTC GCC ACC AGA AGC TTT GGC AGC TCC TCA ACT TCC GGC 
ATT ACG GGC GAC AAT ACG ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC 
TGC CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC ATG CCC CCC CTG 
GAG GGG GAG CCT GGG GAT CCG GAT CTT AGC GAC GGG TCA TGG TCA ACG 
GTC AGT AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC TCA ATG TCT 
TAC TCT TGG ACA GGC GCA CTC GTC ACC CCG TGC GCC GCG GAA GAA CAG 
AAA CTG CCC ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC CAC AAT 
TTG GTG TAT TCC ACC ACC TCA CGC AGT GCT TGC CAA AGG CAG AAG AAA 
GTC ACA TTT GAC AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC GTA 
CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA GTG AAG GCT AAC TTG CTA 
TCC GTA GAG GAA GCT TGC AGC CTG ACG CCC CCA CAC TCA GCC AAA TCC 
AAG TTT GGT TAT GGG GCA AAA . GAC GTC CGT TGC CAT GCC AGA AAG GCC 
GTA ACC CAC ATC AAC TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA 
ACA CCA ATA GAC ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT 
CAG CCT GAG AAG GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC 
GAT CTG GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT 
ACC AAG CTC CCC TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAC 
TCA CCA GGA CAG CGG GTT GAA TTC 

Fig. 4 
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■ STTGEIPFYGKAIPLE 
CC ACC ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTC GAA 

VIK GGRHLIFCHSKKK 
GTA ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG 

CDELAAKLVALGINAV 
TGC GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG 

AYYRG LDVSV IPTSGD 
GCC TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT 

VVVVATDALMTGYTGD 
GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC 

FDSVIDCNTCVT QTVD 
TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 

FSLDPTFTIETITLPQ 
TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG 

DAV SRTQRR G RTGBRGK 

GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG 

PGIYRFVAPGERPSGM 
CCA GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC CCC TCC GGC ATG 

FDSSVLCECYDAGCAW 
TTC GAC TCG TCC GTC CTC TGT GAG TGC TAT GAC GCA GGC TGT GCT TGG 

Y E LAT PA E T T V R L R A Y M 
TAT GAG CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA GCG TAC ATG 

NTPGLPVCQD* 
AAC ACC CCG GGG CTT CCC GTG TGC CAG GAC Fig . 5 
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Epitope Comparison/Delineation 

225 2754 3024 

i y Clone 33C 

_ ~ (799bp) 

Clone 33C-U 3311 

(529bp) 



409.1.1 (557bp) 



409.1.1 (c+270) 
(340bp) 



Fig. 7 



i 



409.1.1' F1R2 (270bp) — 312 9 

409.1.1 (a) (375bp) ^ 

3129 

409.1.1 (ca) (445bp)" 



409.1.1 (n) (105bp) 
409.1.1 (C) (70bp) 



WO 91/15516 PCT/US91/02370 

9/12 

GGG 

1 CC ATG GGC ACG ATT CCC AAA CCT CAA AAA AAA AAC AAA CGT 
MGT IPKPQKKNKR 
43 AAC ACC AAC CGT CGC CCA CGG GAC GTC AAG TTC CCG GGT GGC 

NTNRRPRDVKF PGG 
85 GGT CAG ATC GTT GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC 

6 QIVGGVYLLPRRG 
127 CCT AGA TTG GGT GTG CGC GCG ACG AGA AAG ACT TCC GAG CGG 

PRLGVRATRK TSER 
169 TCG CAA CCT CGA GGT AGA CGT CAG CCT ATC CCC AAG GCT CGT 

SQPRGRRQpi PKAR 
211 CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG TAC CCT TGG 

RPEGRTWAQPGYPw 
253 CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG CTC 

PLYGNEGCGWAGWL 
295 CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC 

LS PRGS RPSWGPTD 
337 CCC CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC 

PRRRSRNLGKVIDT 
379 CTT ACG TGC GGC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC 

LTCGPADLMGYIPL 
421 GTC GGC GCC CCT CTT GGA GGC GCT GCC AGG GCC CTG GCG CAT 

VGA PLGG AARALAH 
463 GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA ACA GGG 

GVRVL EDGVNYATG 
505 AAC CTT CCT GGT TGC TCT TTC TCT ATC TTC CTT CTG GCC CTG 

NLPGCSPSIPLLAL 
547 CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC CAA GTG CGC 

LSCLTVPASAYQVR 
589 AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT AAC 

NSTGLYHVTNDCPN 
631 TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 659 

SSIVYEZZGS 



Fig- 8A 



91/15516 



PCT/US91/02370 



10/12 



10 20 30 40 50 



60 



I I I l I I 

AAA 
(wild Type) 

* A J^^SS^SSS^S CT ^^^^ C ^ C GTAACACCAACCGTCGCCCACAG 
qatCCATGGGCACGAATCCTAAAe-> 

Primer CI 

METGlyThrAsnProLysProGlnLysLysAsnLysArgAsnThrAsnArgArgProGln 
70 80 90 100 110 120 

GACGTCAAGTTCCCGGGTGGCGGTGAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGG 

aaqcce ataaatGGAGTTTACTTGTTGCC -> 
Primer CI 00 

<~ TCAGATCG TTGGTGGAGTTTtaataggaatccqg 
primer NC105 

AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArg 
130 140 150 160 170 180 



GGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGT 
GlyProArgl^uGlyValArgAlaThrArgLysThrSerGluArgSerGlnProArgGly 
190 200 210 220 230 240 

I I I I I I 

AGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGG 
ArgArgGlnProIleProLysAlaArgArgProGluGlyArgThrTrpAlaGlnProGly 
250 260 270 280 290 300 

'III!! 

TACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCC 

qaacccatG GGCTGPGGGTGGGCGGG ~ 

Primer C270-> — 

<— CTCTATGGCA AT GAGGGCTa a qqa 1-n nrrrri-r* 

<-Primer *JC270 

TyrProTrpProLeuTyrGlyAsnGluGlyCysGlyTrpAlaGlyTrpLeuLeuSerPro 



Pig. 8B 
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310 320 330 340 350 360 

I I I I I I 

l l l I l l 

CGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGT 

qaqcccaTGGGT 
(C360->) 

<-- GGCGTAGGTCGCGCAATTTGtaa 
<-Primer NC360 

gqatccqgcc 

ArgGlySerArgProSerTrpGlyProThrAspProArgArgArgSerArgAsnLeuGly 

370 380 390 400 410 420 

I I I I l I 

I I I I I I 

AAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTC 
AAGGTCATCGATACC 

Primer C360-> 

LysVallleAspThrLeuThrCysGlyPheAlaAspLeuMETGlyTyrlleProLeuVal 
430 440 450 460 470 480 

I ! I ! S 

GGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGAC 
< — GGAGGCGCTGCCAGGGCCtaaaaatccqqcc 
< — Primer NC450 

GlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValArgValLeuGluAsp 

490 500 510 520 530 540 

I I l l l I 

I I l l l l 

GGCGTGAACTATGCAACAGGGAAC CTT CCTGGTTGCTCTTTCTCTATCTTCCTTCTGGCC 
<~-GCAACAGGGAACCTyCCTGGTTaaqqatccqqcc 

GlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePheLeuLeuAla 

550 560 570 580 590 600 

I l I I I I 

I I I I I I 

CTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGTGCGCAACTCCACGGGGCTT 
< — CTGTGCCCGCTTCGGCCTAaqqatccqacc 

LeuLeuSerCysLeuThrValProAlaSerAlaTyrGlnValArgAsnSerThrGlyLeu 

610 620 630 640 650 

TACCACGTcAcCAATGATTGCCCTAACTCGAGCATTGTGi'ACGAGTAATAGGGATCC 

<— GTACGAGTAATAGGGATCCcraa 
< — Primer NC660 
(3» primer) 



TyrHisValThrAsnAspCysProAsnSerSerlleValTyrGlu GlySer 

Fig. 8B (con't) 
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